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导读 


© 李兵 


《 音 系 学 通 解 》 是 一 部 介绍 音 系 学 基础 理论 的 教科 书 。 这 部 教科 书 介绍 了 从 
以 SPE (The Sound Pattern of English) (Chomsky & Halle, 1968, 在 本 导读 中 以 后 提 及 
简称 SPE) 为 代表 的 经 典 生成 音 系 学 至 优选 论 (Prince & Smolensky, 1993; McCarthy 
& Prince, 1993) 为 止 几乎 所 有 重要 的 音 系 学 理论 ， 讨 论 了 音 系 学 里 许多 具有 重要 
理论 意义 的 经 典 问题 和 前 沿 领域 的 热点 问题 。 全 书 每 一 章 都 附 有 思考 题 和 练习 
题 ， 并 附 有 参考 答案 。 正 文 和 练习 题 涉 及 一 百 多 种 语言 的 材料 。 作 者 精心 安排 内 
容 ， 精 选 语言 材料 ， 在 二 百 多 页 的 篇 幅 里 以 简练 的 行文 解释 了 诸多 重点 问题 。 这 
部 教科 书 于 1998 年 出 版 ，2005 年 出 了 第 二 版 。 第 二 版 除了 纠正 了 第 一 版 里 的 印刷 
错误 之 外 ， 还 修改 、 增 删 了 少 部 分 内 容 。 第 二 版 增加 的 内 容 主要 包括 优选 论 对 音 
系 星 暗 现象 的 处 理 方法 和 对 句子 层面 上 音 系 结构 的 分 析 。 第 二 版 所 增加 的 内 容 反 
映 了 音 系 学 研究 的 最 新 进展 。 近 十 年 来 ， 笔 者 为 语言 学 专业 研究 生 讲授 音 系 学 基 
础 理论 ， 一 直 使 用 这 部 教科 书 ， 感 觉 该 书 在 教学 中 比较 实用 。 如 同 其 他 同类 教科 
书 一 样 ， 这 部 教科 书 也 有 自身 的 不 足 ， 例 如 对 某 些 重要 的 内 容 阐述 不 够 精细 ， 因 
此 学 生 需 要 阅读 相关 的 文献 以 弥补 不 足 ， 此 外 ， 每 章 的 练习 题 数 量 不 多 ， 因 此 需 
要 增添 练习 题 。 学 习 音 系 学 的 学 生 ， 特 别 是 初学 者 ， 必 须 做 大 量 的 练习 题 ， 因 为 
练习 题 对 学 生 掌 握 理 论 和 分 析 方 法 具有 非常 重要 的 作用 。 

生成 音 系 学 是 当代 音 系 学 的 主流 。 这 部 教科 书 介 绍 的 理论 、 基 本 概念 和 分 
析 方 法 主要 源 自生 成 音 系 学 的 研究 成 果 。 为 了 更 好 地 掌握 本 书 的 内 容 ， 有 必要 
对 生成 音 系 学 的 基本 理论 、 重 大 理论 、 建 树 和 发 展 趋势 作 扼要 的 介绍 。 


一 、 生 成 音 系 学 简介 


就 具体 研究 对 象 而 言 ， 生 成 音 系 学 在 不 同时 期 内 的 研究 重点 有 所 不 同 ， 在 
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分 析 方 法 方面 ， 不 同 流派 又 有 各 自 的 特点 。 但 是 ， 在 认识 论 、 理 论 目标 以 及 方 
法 论 方面 ， 生 成 音 系 学 始终 保持 其 特有 的 显著 特征 。 


1. 生成 音 系 学 的 认识 论 
要 了 解 生成 音 系 学 的 理论 ， 应 当 首先 了 解 这 种 理论 对 自然 语言 语音 的 认 


识 。 生 成 音 系 学 对 语音 的 认识 大 致 可 以 概括 为 如 下 几 点 : 语法 是 人 脑 所 具有 的 
语言 知识 ， 音 系 是 语言 知识 中 关于 语音 的 那 一 部 分 ， 语 音 数据 是 音 系 知 识 的 
外 部 表征 ， 人 类 语言 语音 系统 具有 共同 的 本 质 属性 ， 它 们 决定 了 可 能 的 音 系 结 
构 ， 排 除了 不 可 能 的 音 系 结构 ， 确 定 了 可 能 的 、 但 又 是 不 同 的 音 系 结构 的 差异 
范围 ， 儿 童 音 系 获得 的 基础 是 儿童 出 生 时 所 具有 的 处 于 初始 状态 的 作为 普遍 语 
法 一 部 分 的 音 系 共性 。 

根据 生成 音 系 学 奠基 人 Morris Halle 的 观点 ， 音 系 是 自然 语言 的 本 质 属性 。 
人 类 具有 包括 语言 在 内 的 许多 符号 系统 ， 所 有 的 符号 系统 都 有 句法 (符号 之 间 
的 关系 ) 和 语义 (符号 的 所 指 ) ， 但 是 只 有 语言 才 有 音 系 ， 音 系 是 把 语言 同 其 
他 所 有 符号 系统 区 分 开 的 最 重要 标志 。 


2. 生成 音 系 学 的 目标 

对 音 系 本 质 的 认识 决定 了 生成 音 系 学 的 理论 目标 : 揭示 人 脑 中 的 音 系 知 
识 ， 特 别 是 要 揭示 决定 人 类 语言 音 系 结构 及 其 变异 范围 和 作为 儿童 语音 发 展 基 
础 的 音 系 共性 。 

音 系 知识 是 抽象 的 ， 是 大 脑 某 种 层次 上 的 表征 ， 迄 今 为 止 人 们 尚 无 法 通过 
直接 观察 的 手段 去 认识 ， 因 此 ， 对 音 系 知 识 的 认识 只 能 通过 观察 到 的 音 系 外 部 
表征 〈 语 言 数据 ) 来 建立 某 种 假说 或 假设 ， 通 过 演绎 性 的 推论 得 出 初步 结论 ， 
然后 把 初步 结论 放 在 更 多 的 语言 数据 中 加 以 检验 证 实 。 音 系 知识 具有 模式 化 特 
点 ， 因 此 在 分 析 上 应 当 建 立 一 套 能 够 满足 模式 描写 需要 的 、 可 操作 的 、 具 有 形 
式 演绎 性 质 的 公理 系统 。 这 套 公理 系统 能 够 满足 描写 、 解 释 和 预测 自然 语言 
所 有 可 能 的 音 系 现象 的 需要 。 


3. 生成 音 系 学 的 方法 论 

生成 音 系 学 的 理论 目标 决定 了 它 的 方法 论 。 概 括 起 来 说 ， 它 的 方法 论 具 
有 以 下 特点 : (1) 强调 理论 的 核心 地 位 ， (2) 分 析 框 架 是 由 一 套 可 算 的 、 具 
有 形式 演绎 性 质 的 公理 系统 构成 的 ， 公 理 系统 的 特点 之 一 在 于 自足 性 ， 分 析 不 
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必 借 助 系统 以 外 的 任何 概念 或 手段 就 可 完成 (3) 元 语言 和 形式 化 表达 是 理 
论 表 述 的 主要 方式 ， (4) 强调 演绎 法 、 假 说 和 假设 在 理论 发 展 中 的 作用 。 诚 
然 ， 音 系 学 毕竟 是 一 门 经 验 科 学 ， 语 音 数据 在 理论 假设 的 产生 和 发 展 中 起 到 至 
关 重 要 的 作用 。 如 果 忽 视 语音 事实 的 存在 ， 音 系 学 将 失去 研究 对 象 ， 从 而 失去 
作为 一 门 科学 的 意义 。 音 系 学 和 语音 学 之 间 存 在 密切 的 关系 。 虽 然 音 系 学 和 语 
音 学 有 着 各 自 不 同 的 研究 对 象 和 理论 目标 ， 前 者 追求 的 是 音 系 事实 ( 音 系 语法 
知识 ) ， 后 者 注重 的 是 可 以 观察 到 的 语音 事实 ， 但 说 明 两 类 事实 之 间 的 关系 应 
当 是 音 系 学 和 语音 学 的 共同 任务 。 


二 、 生 成 音 系 学 的 发 展 


1. 经 典 生成 音 系 学 
经 典 生成 音 系 学 代表 作 是 SPE, 因此 经 典 生成 音 系 学 理论 通常 称 作 SPE 模 


式 。SPE 模 式 的 分 析 框 架 包 括 音 系 部 分 和 语音 部 分 。 音 系 部 分 主要 包括 语素 的 
HA (底层 ) 表达 和 语音 (表层 ) 表达 两 个 主要 层面 以 及 连接 两 个 层面 的 音 系 
规则 。 音 段 是 由 特征 矩阵 构成 的 ， 和 矩阵 内 的 特征 没有 结构 关系 。SPE 模 式 中 ， 
表达 是 线性 的 ， 即 所 有 的 成 分 出 现在 同一 音 层 上 。 重 音 、 声 调 等 超 音 段 特征 也 
被 视 作 音 段 成 分 。SPE 模 式 的 区 别 特征 系统 主要 包括 一 套 以 发 音 机 制 为 基础 的 
偶 值 特征 。 音 系 表达 中 的 语素 是 充分 赋值 的 。 由 于 表达 单位 是 语素 ， 因 此 ， 音 
位 和 音节 在 SPE 模 式 中 没有 理论 地 位 。 

音 系 规则 在 SPE 模 式 形式 系统 中 占据 核心 地 位 。 音 系 规则 以 特定 的 方式 作 
用 于 音 系 表达 ， 推 导出 语音 表达 。 在 有 些 情况 下 ， 从 音 系 表达 到 语音 表达 的 推 
导 过 程 涉及 若干 个 按 一 定 顺 序 应 用 的 音 系 规则 。 在 这 种 情况 下 ， 在 音 系 表达 和 
语音 表达 之 间 可 以 有 若干 个 中 间 层 面 。 作 为 音 系 部 分 的 输出 项 ， 语 音 表达 进入 
语音 部 分 ， 并 在 随后 的 语音 实施 规则 的 作用 下 获得 可 被 感知 的 语音 形式 。 

SPE 是 生成 音 系 学 的 黄 基 之 作 。 它 的 意义 在 于 系统 地 提出 了 生成 音 系 学 的 
认识 论 、 理 论 目标 和 方法 论 。SPE 模 式 的 重要 性 在 于 它 提 出 的 一 系列 假设 和 这 
些 假设 所 引发 的 问题 。SPE 以 后 的 音 系 学 理论 正 是 针对 这 些 假 设 和 问题 发 展 起 
来 的 。 

20 世 纪 60 年 代 末 和 70 年 代 初 ， 生成 音 系 学 的 讨论 主要 集中 在 规则 作用 方 
式 和 音 系 表达 抽象 程度 这 两 个 问题 上 。 讨 论 的 主要 结果 是 ， 分 析 人 允许 音 系 表 达 
有 共有 一 定 程度 的 抽象 ， 但 抽象 的 音 系 表 达 必 须 得 到 自然 的 语音 解释 , 从 而 减少 
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规则 和 中 间 层 面 的 数量 (参见 Kiparsky, 1968 ; Anderson, 1974 ; Kenstowicz 
& Kisseberth, 1977 ; Sommerstein, 1977). 


2. 词 库 音 系 学 
词 库 音 系 学 以 否定 SPE 中 词 库 无 内 部 结构 的 假设 为 突破 口 ， 建 立 了 一 个 


把 构 词 过 程 和 音 系 过 程 置 于 同一 模块 的 理论 体系 (Mohanan, 1982 ; Kiparsky 
1982，1985 ; Pulleyblank， 1986) 。 根 据 词 库 音 系 学 的 假设 ， 词 库 具 有 内 部 
结构 ， 由 若干 层面 构成 。 层 面 由 构 词 一 形态 的 属性 定义 。 根 据 是 否 与 构 词 一 形 
态 属性 相关 的 标准 ， 音 系 规则 分 为 词 库 规则 和 后 词 库 规则 ， 前 者 在 词 库 模 块 内 
应 用 ， 后 者 只 能 在 句法 部 分 之 后 的 后 词 库 模块 内 应 用 。 词 库 音 系 学 把 构 词 一 形 
态 学 引入 音 系 学 ， 不 仅 使 生成 音 系 学 体系 进一步 系统 化 和 模块 化 ， 而 且 带 动 了 
构 词 一 形态 的 深入 研究 。 虽 然 在 最 初 产生 时 词 库 音 系 学 是 一 种 比较 接近 SPE 模 
式 的 线性 理论 ， 但 它 在 其 后 的 发 展 过 程 中 吸收 了 非 线性 音 系 学 和 优选 论 中 的 假 
设 ， 使 自身 成 为 一 种 贯穿 生成 音 系 学 发 展 全 程 、 具 有 长 久生 命 力 和 较 强 应 用 价 
值 的 音 系 理论 之 一 。 


3. 非 线性 理论 

在 音 系 表达 方面 的 重大 进展 是 非 线 性 理论 。 非 线性 理论 以 SPE 模 式 音 系 结 
构 的 线性 表达 无 法 解释 特定 范畴 或 具有 特定 性 质 的 音 系 成 分 (例如 声调 ) 的 形式 
特点 这 一 缺陷 为 突破 口 , 提出 了 音 系 结构 的 非 线性 (或 多 音 层 ) 表 达 , 建立 了 以 自 
主音 段 音 系 学 和 节律 音 系 学 为 代表 的 非 线 性 理论 (Kahn, 1976 ; Goldsmith, 
1979、1990) 。 表 达 的 非 线性 化 导致 为 数 众多 的 音 系 规则 失去 存在 的 意义 ， 但 
是 没有 音 系 规则 又 无 法 说 明 底层 表达 是 如 何 被 推导 成 表层 表达 。 为 此 ， 非 线性 
理论 提出 了 使 用 联接 线 建 立 不 同音 层 上 各 种 音 系 成 分 之 间 的 联系 。 联 接 过 程 就 
是 音 系 表达 被 推导 为 语音 表达 的 过 程 。 


4. 不 充分 赋值 理论 
在 音 系 表达 研究 方面 取得 的 另 一 个 重要 进展 是 不 充分 赋值 理论 
(Archangeli， 1984) 。 在 SPE 模 式 中 , 音 系 表达 是 充分 赋值 的 ， 音 系 规则 的 作用 
在 于 改变 特征 值 。 随 后 的 研究 发 现 , 某 些 鲜 余 特 征 可 以 不 通过 规则 的 作用 ， 即 以 
缺 省 方式 ， 便 可 成 为 语音 表达 所 需要 的 语音 材料 。 据 此 ， 不 充分 赋值 理论 认为 ， 
音 系 表达 应 当 排 除 羡 余 特征 。 排 除 羡 余 特征 后 的 音 系 表达 仅仅 包括 语素 必要 的 音 
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系 信 息 ， 突 出 了 语素 的 音 系 结构 特点 ， 从 而 减少 所 涉及 音 系 规则 的 数量 。 


5。CV 音 系 学 
在 SPE 模 式 中 ， 音 系 规则 作用 域 是 语素 界线 符号 界定 的 音 段 序列 ， 音 节 不 


具有 结构 单位 的 地 位 ， 因 此 无 法 解释 以 音节 为 结构 单位 或 以 音节 为 作用 域 的 
音 系 过 程 。 非 线 性 理论 的 出 现 使 得 音节 作为 一 种 结构 单位 和 音 系 过 程 作用 域 
的 直觉 得 到 了 形式 化 的 表达 ， 产 生 了 以 音节 结构 为 主要 研究 对 象 的 CV 音 系 学 
(Clements & Keyser, 1983)。 


6. 特征 几何 理论 

在 SPE 模 式 中 ， 音 段 是 一 个 没有 内 部 结构 的 特征 集 。 这 种 假设 在 分 析 某 些 
涉及 若干 个 特征 同时 扩散 的 音 系 过 程 时 遇 到 了 困难 。 对 若干 个 特征 同时 扩散 
现象 的 研究 导致 了 以 音 段 结构 为 研究 对 象 的 特征 几何 理论 的 产生 (Clements, 
1985; Sagey, 1986) 。 在 特征 几何 模型 中 ， 语 音 特 征 通过 若干 个 属于 上 位 节 
点 的 特征 群 的 方式 组 织 起 来 。 特征 几何 的 形式 是 树 形 结构 。 树 的 顶点 是 一 个 代 
表 整 个 音 段 的 根 节点 ， 根 节点 制约 着 若干 个 代表 高 层次 特征 的 次 位 节点 ， 这些 
次 位 节点 又 分 别 制约 着 一 群 终端 成 分 。 所 有 的 特征 以 分 枝 树 形 的 方式 相互 联系 
着 , 每 一 个 特征 在 树 形 结构 中 都 有 各 自 位 置 。 这 种 方法 使 所 有 的 特征 结合 起 来 
构成 一 个 完整 的 音 段 结构 体 。 若 干 个 特征 同时 扩散 是 次 位 节点 扩散 的 结果 。 


7. 韵律 音 系 学 

在 SPE 模 式 里 ， 音 系 规则 的 作用 域 是 由 语素 界线 符号 界定 的 。 然 而 , 人 们 观 
察 到 某 些 音 系 过 程 的 作用 域 可 以 大 于 词 。 葛 律 音 系 学 (Selkirk, 1972) 提出 了 音 
系 过 程 的 作用 域 是 由 句 段 的 韵律 结构 决定 的 假设 ， 把 对 音 系 过 程 的 分 析 置 于 比 词 
更 大 的 范围 之 中 ， 从 而 使 对 音 系 过 程 的 分 析 同 句法 范畴 和 句法 结构 联系 起 来 。 


8. 优选 论 ! 

优选 论 (McCarthy & Prince, 1993 ; Prince & Smolensky， 1993) 是 生 
成 音 系 学 最 重要 的 理论 发 展 。 优 选 论 认 为 ， 普 遍 语法 既 不 体现 为 音 系 表达 的 某 
种 形式 ， 也 不 体现 为 音 系 规则 ， 而 是 体现 在 制约 条 件 对 语音 表达 的 限制 上 。 普 
遍 语法 是 由 制约 条 件 组 成 的 ， 每 一 种 具体 语言 包含 一 组 具有 普遍 意义 的 制约 条 
件 。 制 约 条 件 的 基本 特点 之 一 是 普遍 性 。 语 言 之 间 的 差别 在 于 制约 条 件 在 具体 
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语法 中 层级 排列 的 不 同 。 语 音 表达 多 样 性 是 制约 条 件 相互 冲突 并 通过 制约 条 件 
的 层级 排列 解决 冲突 的 结果 。 制 约 条 件 另 一 个 基本 特点 是 可 违反 性 。 优 选 论 多 
许 符合 语法 的 语音 表达 违反 制约 条 件 。 任 何 一 个 制约 条 件 都 是 可 以 违反 的 , 但 
是 符合 语法 的 形式 应 当 最 大 限度 地 满足 制约 条 件 层级 体系 提出 的 对 各 种 范畴 的 
要 求 或 限制 。 

优选 论 建立 了 一 套 简单 但 严密 的 分 析 框架 。 如 下 图 所 示 : 


— | 装置 | 一 | 候选 而， 
ries 候选 项 。 


生成 装置 Gen 是 普遍 语法 的 组 成 部 分 ， 固定 在 每 一 种 语言 里 。 它 的 作用 是 
为 特定 的 输入 项 (底层 表达 ) 制造 在 数量 上 无 限 的 候选 项 。 所 有 的 候选 项 必须 经 
过 同一 制约 条 件 层 级 体系 Con 的 评估 。 经 过 评估 ， 最 大 限度 地 满足 制约 条 件 层 
级 体系 的 候选 项 被 确定 为 优选 项 ， 即 符合 语法 的 语音 表达 形式 。 从 底层 表达 到 
表层 表达 是 通过 一 次 映射 完成 的 。 在 优选 论 中 ， 音 系 规则 和 中 间 层 次 是 不 存在 
的 。 

优选 论 首先 产生 于 音 系 领域 的 研究 ， 随 后 优选 论 的 原理 在 名 法、 儿童 语言 
获得 、 语 言 演变 和 语 用 等 领域 得 到 广泛 的 应 用 。 

随 着 优选 论 的 出 现 ， 生 成 音 系 学 在 对 音 系 本 质 的 理解 方面 产生 了 重大 分 
层 ， 出 现 了 两 种 对 立 的 假设 : 规则 推导 论 和 制约 条 件 论 。 两 种 假设 对 立 的 焦点 
在 于 是 否 承认 推导 过 程 的 存在 。 规 则 推导 论 坚 持 传统 生成 音 系 学 的 假设 ， 认 为 
音 系 的 实质 性 内 容 主要 是 音 系 规则 和 根据 规则 推导 出 来 的 中 间 层 次 。 推 导论 不 
否认 制约 条 件 的 存在 ， 认 为 制约 条 件 在 一 定 情况 下 对 推导 过 程 和 音 段 配 列 起 到 
限制 作用 。 制 约 条 件 论 则 完全 和 否定 音 系 规则 和 中 间 层 次 的 存在 ， 仅 仅 依 靠 制 约 
条 件 的 层级 排列 解释 表层 形式 上 的 差异 。 

由 于 否定 中 间 表 达 层 次 的 存在 ， 优 选 论 面临 一 系列 需要 解决 的 问题 ， 其 中 
最 主要 的 是 音 系 睡 暗 现 象 。 

在 规则 推导 论 里 ， 音 系 星 暗 源 于 音 系 规则 的 反馈 给 顺序 和 反 阻 断 顺 序 。 星 
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暗 现 象 的 两 种 基本 形式 是 表层 非 显 和 表层 非 真 。 概 括 起 来 说 ， 音 系 星 暗 现象 是 
顺序 在 先 的 规则 应 用 被 顺序 在 后 的 规则 的 应 用 所 掩盖 的 结果 ， 上 顺序 在 先 的 规则 
应 用 的 过 程 在 表层 上 没有 得 到 反映 (表层 非 显 ) 或 者 没有 得 到 真实 的 反映 (K 
层 非 真 ) 。 

音 系 临 暗 现象 是 不 同 规则 按 顺 序 应 用 的 逻辑 必然 ， 在 两 条 规则 的 应 用 过 程 
之 间 必 然 存在 一 个 中 间 表 达 层 次 。 但 是 ， 因 为 优选 论 分 析 框 架 内 不 存在 中 间 表 
达 层 次 ， 无 法 再 现 中 间 层 次 上 的 音 系 表达 ， 所 以 制约 条 件 不 能 识别 和 判断 含有 
星 暗 性 的 输出 项 。 或 者 说 ， 优 选 论 的 静态 的 制约 条 件 关 系 体系 无 法 以 形式 化 方 
式 再 现 规 则 推导 论 的 动态 推导 过 程 中 反馈 给 关系 和 反 阻 断 关 系 表现 出 的 清晰 的 
推导 步骤 和 确定 的 推导 结果 。 

TF AMM RR ET IEE PEK. ERTL VA AIEEE SF 
过 程 中 产生 的 现象 ， 优 选 论 作出 多 种 尝试 ， 试 图 通过 某 种 形式 化 方式 再 现 睡 暗 
现象 ， 从 而 使 0T 成 为 一 种 真正 具有 概括 力 、 解 释 力 和 预测 力 的 语言 理论 。 优 
选 论 的 这 些 尝 试 主要 包括 有 限 和 谐 序 列 论 、 多 语法 假设 、 层 面 优选 论 、 对 应 理 
论 、 共 感 论 、 比 较 标 记 理论 和 候选 项 链 等 理论 。 关 于 优选 论处 理 音 系 临 暗 现象 
的 各 种 方法 的 介绍 ， 读 者 可 以 参考 李兵 (2005) MARR (2008) 。 

近年 来 ， 优 选 论 呈 现 出 功能 主义 倾向 ， 其 特点 是 物理 、 生 理 、 行 为 等 非 语 
法 因素 被 看 作 是 构建 制约 条 件 系统 的 语言 学 基础 。 功 能 主义 的 优选 论 受到 了 形 
式 主 义 语言 学 的 尖锐 批评 (参见 Hale & Reiss, 2000; 李兵 ，2008) 。 


三 、 对 《 音 系 学 通 解 》 的 介 评 


第 一 章 ”语音 的 产生 

本 章 介 绍 音 系 学 的 语音 学 基础 ， 从 发 音 机 制 ， 发 声 类 型 和 气流 机 制 等 方面 
简明 扼要 地 介绍 了 语音 产生 的 原理 。 已 经 修 过 普通 语音 学 的 读者 可 以 跳 过 这 一 
章 。 本 章 对 声调 产生 的 语音 学 基础 介绍 过 于 简单 ， 对 声调 和 变调 现象 感 兴趣 的 
读者 可 以 读 Tone: A Linguistic Survey (Fromkin， 1978) 一 书 中 的 有 关 章 节 ， 以 便 
进一步 了 解 声调 产生 的 语音 学 原理 。 


第 二 章 RMA: 同一 与 差异 
同形 态 一 句法 结构 一 样 ， 音 系 结构 具有 层次 性 和 递归 性 特点 。 音 系 学 研究 
的 三 个 主要 方面 是 (1) 对 比 ， (2) 音 段 配 列 ， (3) 交替 (参见 Goldsmith, 
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1995) 。 从 类 型 学 的 角度 看 ， 虽 然 不 同 的 语音 系统 在 音 段 系统 、 音 段 配 列 和 交 
蔡 方 面具 有 差异 ， 但 也 表现 出 许多 共性 。 音 系 共性 和 差异 之 间 在 一 定 程度 上 能 
够 通过 蕴涵 关系 联系 起 来 。 音 系 学 的 主要 任务 在 于 确定 音 系 共性 以 及 具体 音 系 
的 变异 范围 。 从 音 系 学 的 角度 看 ， 儿 童 音 系 获得 的 实质 是 一 个 在 某 种 先天 的 具 
有 普遍 意义 的 音 系 结构 的 基础 上 增添 具体 语言 音 系 信息 的 过 程 。 需 要 指出 的 
是 ， 本 书 作 者 在 这 里 所 说 的 音 系 共性 是 类 型 学 意义 上 的 共性 ， 虽 然 可 以 作为 建 
立 关 于 先天 的 普遍 音 系 结构 假设 的 基础 ， 但 二 者 并 不 相同 : 前 者 是 音 系 结构 的 
外 部 体现 ， 属 于 语言 数据 范畴 ， 后 者 是 人 脑 固有 的 某 种 属性 ， 属 于 语言 知识 范 
WF 


第 三 章 ”符合 语法 的 形式 从 何 而 来 

外 来 形式 的 本 族 化 能 够 充分 反映 不 同音 系 之 间 的 异同 。 本 族 化 过 程 分 成 两 
个 阶段 : 感知 阶段 和 操作 阶段 。 在 感知 阶段 ， 本 族 语 使 用 者 根据 其 本 族 语音 段 
系统 解释 所 感知 的 外 来 形式 ， 在 操作 阶段 ， 本 族 语 使 用 者 根据 本 族 语 的 特点 来 
调整 他 所 解释 的 外 来 形式 。 

那么 ， 本 族 语 使 用 者 根据 什么 对 外 来 形式 进行 调整 呢 ? 本 书 分 别 介绍 了 传 
统 生成 音 系 学 的 推导 论 和 优选 论 的 制约 条 件 论 。 根 据 前 者 ， 本 族 语 使 用 者 对 外 
来 形式 调整 的 依据 是 本 族 语 的 音 系 规则 ， 调 整 过 程 是 运用 一 系列 音 系 规则 按 一 
定 的 顺序 把 其 所 解释 的 外 来 形式 (可 以 理解 为 词 项 的 底层 表达 ) 推导 成 符合 
本 族 语 语 法 的 表层 形式 。 

根据 后 者 ， 调 整 过 程 是 根据 本 族 语 的 制约 条 件 层级 排列 对 与 底层 表达 有 关 
的 不 同 表层 形式 进行 评估 和 选择 的 过 程 。 符 合 语法 的 表层 形式 是 两 种 范畴 制约 
条 件 相互 作用 的 结果 。 一 种 范畴 叫 作 忠实 性 ， 另 一 范畴 叫 作 无 标记 性 。 每 个 范 
畴 包括 一 组 具有 普遍 意义 的 制约 条 件 。 属 于 忠实 性 范畴 的 制约 条 件 要 求 输出 项 
和 输入 项 相同 。 属 于 无 标记 性 范畴 的 制约 条 件 要 求 输出 项 是 自然 语言 中 普遍 和 
自然 的 形式 。 从 逻辑 上 讲 ， 如 果 忠 实 性 范畴 制约 条 件 的 层级 排列 高 于 无 标记 性 
范畴 制约 条 件 ， 调 整 将 不 可 能 ， 如 果 无 标记 性 范畴 制约 条 件 的 层级 排列 高 于 忠 
实 性 范畴 制约 条 件 ， 那 么 一 种 语言 的 词 项 表层 形式 将 是 相同 的 。 这 两 种 逻辑 上 
的 可 能 确定 了 语言 之 间 的 变异 范围 。 为 了 解释 不 同音 系 之 间 的 差异 ， 优 选 论 认 
为 ， 属 于 两 种 范畴 的 具体 制约 条 件 的 层级 排列 是 交错 的 ， 特 别 是 允许 无 标记 性 
制约 条 件 通过 不 同 的 层级 排列 而 交互 作用 。 在 无 标记 性 问题 上 ， 优 选 论 和 布 拉 
格 学 派 有 所 不 同 。 在 布拉格 学 派 的 标记 理论 中 ， 标 记性 是 单一 结构 层次 上 的 ， 
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在 优选 论 中 ， 表 层 形式 是 若干 属于 不 同 结构 层次 的 无 标记 性 制约 条 件 交 互 作用 
的 结果 ,符合 语法 的 表层 未 必 是 无 标记 形式 。 (参见 McCarthy, 2002) 

传统 生成 音 系 学 的 推导 论 和 优选 论 都 是 值得 研究 的 理论 。 两 种 理论 有 着 各 
自 的 特点 和 不 足 。 推 导论 的 主要 问题 在 于 在 某 些 情况 下 推导 过 程 过 于 复杂 ， 而 
且 某 些 相关 的 音 系 规则 缺少 概括 性 。 优 选 论 目前 仍然 是 一 种 发 展 着 的 理论 ， 目 
前 遇 到 的 主要 困难 是 由 于 否定 推导 过 程 和 中 间 层 次 而 带 来 的 音 系 星 瞳 问题 。 音 
系 星 上 暗 主 要 包括 两 种 情况 : 一 种 是 表层 形式 非 真 ， 另 一 种 是 表层 形式 非 显 。 针 
对 音 系 星 暗 问题 ， 优 选 论 提出 多 种 解决 问题 的 方法 ， 其 中 主要 包括 对 应 理论 、 
共 感 理论 和 局 部 制约 条 件 复 合体 。 


第 四 章 ”底层 表达 和 表层 表达 
从 分 析 的 角度 看 ， 生 成 音 系 学 区 分 词 项 表达 的 两 个 层面 : 底层 表达 和 表层 


表达 。 词 项 底层 表达 仅仅 包括 无 法 预测 的 音 系 信息 ， 具 有 一 定 程 度 的 抽象 性 。 
词 项 的 表层 表达 是 一 种 具有 全 部 或 大 多 数 语音 信息 的 、 能 够 作为 该 结构 单位 语 
音 现 实 基 础 的 的 形式 。 生 成 音 系 学 区 分 两 个 表达 层面 是 基于 以 下 三 方面 的 考 
虑 。 

首先 ， 从 词 项 的 心理 现实 角度 看 ， 每 个 词 项 必须 以 某 种 简明 的 和 具有 显 
著 特 征 的 形式 储存 在 记忆 之 中 ， 一 方面 使 记忆 单元 能 够 储存 更 多 的 词 项 ， 另 一 
方面 在 交际 时 能 够 以 最 快 的 速度 检索 和 提取 词 项 以 满足 编码 和 解码 的 需要 。 这 
种 形式 就 是 词 项 的 底层 表达 形式 ， 它 以 某 种 显著 的 音 系 信息 和 其 表层 形式 对 应 
着 ,同时 又 排除 了 过 多 地 占据 记忆 空间 和 影响 检索 及 提取 词 项 速度 的 羔 余 语音 
信息 (参见 Bromberger & Halle, 1989) 。 然 而 ， 大 脑 神经 学 的 研究 表明 ， 对 
于 人 脑 的 巨大 记忆 空间 来 说 ， 词 项 鲜 余 语音 信息 并 不 会 导致 记忆 障碍 。 因 此 ， 
记忆 的 词 项 储存 能 力 本 身 并 不 能 作为 底层 表达 存在 的 有 力 证 据 (Kenstowicz, 
1994) 。 其 次 ， 自 然 语言 中 普遍 存在 交替 现象 ， 即 同一 语素 在 不 同音 系 条 件 下 
以 不 同 的 语音 形式 交替 出 现 。 如 果 没有 底层 表达 层面 ， 每 一 个 语素 变 体 将 不 得 
不 以 独立 词 项 的 身份 出 现在 词 库 里 。 语 素 底层 形式 的 建立 不 仅 减 少 了 词 库 中 词 
项 的 数量 ， 更 重要 的 是 ， 能 够 通过 语素 底层 形式 和 音 系 规则 说 明 底层 形式 和 不 
同 语素 变 体 之 间 的 关系 ， 从 而 使 描写 更 具 概括 性 和 解释 力 。 最 后 ， 跨 语言 研究 
表明 ， 鲜 余 特 征 在 音 系 过 程 中 往往 具有 惰性 特点 ， 建 立 底层 表达 并 且 把 鲜 余 特 
征 排除 在 底层 表达 之 外 为 理解 其 羡 余 性 和 惰性 提供 了 形式 基础 。 

虽然 优选 论 通 过 制约 条 件 实现 对 表层 表达 层面 上 符合 语法 形式 的 选择 和 解 
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释 ， 但 是 底层 表达 在 优选 论 的 分 析 框架 中 仍然 是 不 可 缺少 的 。 这 是 因为 ， 仅 就 忠 
实 性 范畴 制约 条 件 而 言 ， 表 层 形式 是 否 符合 语法 必须 通过 和 其 底层 形式 的 对 比 才 
能 确定 。 此 外 ， 儿 童 语言 获得 研究 为 底层 表达 存在 的 必要 性 提供 了 独立 的 证 据 。 


第 五 章 ”区别 性 特征 
区 别 性 特征 是 音 段 的 基本 成 分 ， 也 是 音 系 结构 表达 的 最 小 单位 。 自 然 语言 


的 音 段 系统 是 由 一 套数 量 有 限 的 特征 构成 的 ， 因此， 特征 的 基本 特点 之 一 是 普 
遍 性 。 特 征 概 念 具 有 以 下 作用 : (1) 通过 特征 差别 说 明 音 段 差别 ， 关 于 音 段 
对 立 关 系 的 直觉 得 到 形式 化 表述 ; (2) 通过 特征 的 异同 说 明 音 段 自然 类 之 间 
的 关系 ; (3) 表面 上 似乎 相互 无 关 的 音 系 过 程 可 以 通过 特征 联系 起 来 ， 从 而 
得 到 概括 和 解释 。 

关于 特征 系统 ， 目 前 有 争议 的 主要 问题 有 以 下 几 个 : 

一 、 特 征 的 语音 学 基础 问题 。 特 征 必 须 有 与 其 对 应 的 语音 学 基础 。 对 此 ， 
分 歧 不 大 。 然 而 ， 究 竟 以 音 段 的 发 音 生 理 机 制 ， 还 是 以 音 段 的 声学 特性 或 感知 
特性 为 基础 ， 不 同 研究 者 有 着 不 同 观点 。 本 书 介绍 的 是 以 发 音 生 理 机 制 为 基础 
的 特征 系统 。 

二 、 特 征 值 问题 。 在 这 个 问题 上 ， 大 体 上 有 三 种 观点 : (1) 特征 是 偶 值 
的 ; (2) 特征 是 独 值 的 ，(3) 特征 是 多 值 的 。 本 书 采用 的 特征 大 多 为 偶 值 特 
征 ， 少 数 特征 是 独 值 的 。 

三 、 特 征 范畴 问题 。 作 为 一 部 音 系 学 的 教科 书 ， 本 书 在 这 个 问题 上 综合 了 
SPE 以 及 其 后 多 家 观点 。 关 于 特征 范畴 ， 分 歧 主 要 来 自 三 个 方面 的 考虑 : (1) 
如 果 坚 持 特征 的 语音 学 基础 是 发 音 生理 机 制 的 话 ， 那 么 若干 个 在 发 音 生 理 机 制 
方面 有 联系 的 语音 特点 当中 ， 究 竟 哪 一 个 是 具有 音 系 特征 的 基础 ， 而 其 他 的 则 
可 视 作 附带 生理 现象 的 结果 ? 例如 [tense], [ATR] 和 [RTR] 三 个 特征 之 间 的 关系 
就 是 一 个 典型 的 例子 。 但 是 ， 本 书 把 与 某 些 日 尔 曼 语言 元 音 有 关 但 争议 甚大 的 
[tense] 同 阿 坎 语 元 音 的 [ATR] 以 及 某 些 通古斯 语言 元 音 的 [RTR] 进 行 比较 似乎 
从 妥 。 其 主要 原因 是 ， 日 尔 曼 语言 中 的 [tense] 和 阿 坎 语 [ATR] 以 及 通古斯 语言 
[RTR] 有 着 不 尽 相 同 的 语音 内 容 。 (2) 特征 范畴 同 特征 值 有 关 。 例 如 在 偶 值 特 
征 系统 中 ，[ATR] 和 [RTR] 可 以 归 为 同一 范畴 ， 但 是 在 独 值 特征 系统 中 ，[ATR] 
和 [RTR] 却 属于 两 个 不 同 的 范畴 。(3) 更 加 重要 的 是 ， 具 体 某 个 特征 在 特征 几何 
中 能 否 有 其 恰当 的 结构 位 置 ， 而 且 是 否 能 够 以 此 揭示 音 段 的 对 立 关 系 以 及 相关 
音 系 过 程 的 机 制 。 
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第 六 章 ”规则 的 顺序 
本 章 的 主要 内 容 是 介绍 规则 推导 论 的 音 系 规则 的 应 用 方式 ， 重 点 介绍 别 


处 条 件 (elsewhere condition)、 同 时 规则 顺序 (simultaneous rule ordering) 、 规 
则 的 单一 线性 顺序 (linear rule ordering) 、 规 则 应 用 的 内 在 顺序 (intrinsic rule 
ordering) 、 规 则 应 用 的 外 在 顺序 (extrinsic rule ordering) 等 规则 应 用 方式 以 及 
相关 的 原则 和 概念 ， 演 示 规 则 之 间 的 馈 给 (feeding) 顺序 、 阻 断 (bleeding) 
顺序 、 反 馈 给 (counter-feeding) 和 反 阻 断 (counter-bleeding) 关系 。 

本 章 还 介绍 了 音 系 睡 瞳 现 象 和 两 种 以 制约 条 件 为 基础 的 处 理 方法 : 把 词 库 
音 系 学 和 优选 论 结合 起 来 的 多 语法 假设 分 析 法 和 优选 论 的 共 感 论 分 析 法 。 多 语 
法 假设 坚持 词 库 音 系 学 提出 的 词 库 内 有 多 个 层面 的 假设 ,通过 中 间 层 面 使 星 瞳 
形式 得 以 表达 。 和 词 库 音 系 学 不 同 的 是 ， 多 语法 假设 用 制约 条 件 取 代 了 词 库 音 
系 规则 ， 在 每 一 个 层面 上 都 有 一 个 制约 条 件 层级 排列 系统 。 多 语法 假设 存在 理 
论 和 技术 上 的 问题 。 在 理论 上 ， 一 个 制约 条 件 层级 排列 系统 是 一 种 语法 ， 而 多 
个 层面 则 需要 多 个 制约 条 件 层级 排列 系统 。 据 此 ， 一 种 语言 是 由 多 个 不 同 的 语 
法 系统 构成 的 。 多 语法 假设 的 这 一 理论 后 果 未 能 得 到 广泛 的 接受 。 其 次 ， 在 词 
库 音 系 学 里 ， 层 面 是 由 语素 的 范畴 和 性 质 定义 的 ， 特 定 层面 上 音 系 规则 的 应 用 
受制 于 语素 的 范畴 和 性 质 。 但 是 在 多 语法 假设 里 ， 层 面 是 由 制约 条 件 定义 的 ， 
而 制约 条 件 自 身 与 语素 范畴 和 性 质 无 关 ， 因 此 ， 层 面 的 构建 缺少 理据 。 

共 感 论 试图 通过 同一 输入 项 的 不 同 候选 项 之 间 建 立 一 个 对 应 关系 网 络 再 现 
规则 推导 的 动态 过 程 。 共 感 论 的 基本 方法 是 ， 在 输出 项 集合 里 选择 一 个 被 称 之 
为 “ 共 感 项 ”(sympathetic candidate) 的 候选 项 。 共 感 项 是 能 够 再 现 中 间 层 次 的 
星 暗 形式 ， 其 作用 在 于 把 优选 项 和 输入 项 联系 起 来 。 在 具体 分 析 中 ， 关 键 的 步 
又 一 是 确定 共 感 项 ， 二 是 确定 共 感 项 、 优 选项 、 和 输入 项 之 间 的 对 应 关系 。 确 
定 共 感 项 的 唯一 依据 是 制约 条 件 体系 里 的 某 个 相关 的 而 且 层级 排列 较 低 的 叫 作 
“选择 器 ”(selector) 的 忠实 类 制约 条 件 。 传 统 优选 论 的 忠实 类 制约 条 件 是 一 条 
连接 底层 表达 和 表层 表达 的 直接 通道 ， 那 么 ， 根 据 I-O 对 应 的 忠实 类 制约 条 件 
所 确定 的 共 感 项 则 是 一 条 连接 两 个 表达 层次 的 间接 通道 。 共 感 项 之 所 以 被 选 作 
中 介 形 式 ， 其 原因 正 是 它 一 方面 满足 了 特定 的 LO 对 应 忠实 类 制约 条 件 ， 而 另 
一 方面 它 又 势必 在 形式 特点 方面 与 优选 项 相似 ， 所 以 ， 通 过 共 感 项 ， 源 于 中 间 
表达 层次 上 的 上 星 暗 音 系 表达 形式 能 够 以 符合 语法 的 表层 形式 成 为 优选 项 。 


xviii 导读 

共 感 论 是 为 解决 星 暗 性 问题 做 出 的 最 有 力 尝 试 。 然 而 ， 就 其 解决 问题 范围 
和 分 析 机 制 来 说 ， 共 感 论 仍然 存在 着 一 些 问 题 。 首 先 ， 对 具体 语言 现象 的 分 析 
可 以 看 出 ， 共 感 论 虽然 能 够 解释 表层 非 真 的 星 暗 ， 但 是 仍然 无 法 说 明 表 层 非 显 
的 星 暗 。 其 次 ， 选 择 器 的 性 质 和 来 源 不 十 分 清楚 。 在 有 若干 个 层级 条 件 排列 较 
低 的 LO 对 应 忠实 类 制约 条 件 的 情况 下 ， 对 选择 器 的 选择 具有 不 确定 性 。 作 为 
形式 语言 学 的 优选 论 来 说 ， 不 能 确定 选择 器 和 共 感 项 形式 之 间 的 逻辑 关系 不 仅 
仅 是 技术 层面 上 的 问题 ， 更 重要 的 是 ， 从 语法 理论 基础 角度 看 ， 这 涉及 选择 器 
gH HE REM X. 


第 七 章 PRUE: 荷兰 语 中 的 指 小 后 绥 

本 章 以 荷兰 语 标准 方言 和 锡 塔 德 方言 (Sittard) 为 语言 材料 进一步 讨论 音 系 
规则 应 用 顺序 和 相互 作用 方式 。 在 规则 推导 论 中 ， 尽 管 规则 有 序 性 和 推导 的 中 
间 层次 并 非 必须 表层 显 真 ， 但 对 每 一 个 音 系 过 程 所 做 的 描写 是 清晰 的 。 


第 八 章 ”表达 的 层面 

本 章 重点 从 词 库 音 系 学 的 角度 讨论 推导 过 程 中 间 层 面 的 表达 问题 。 词 库 
由 若干 个 层面 构成 ， 音 系 规则 分 为 词 库 规 则 和 后 词 库 规 则 是 词 库 音 系 学 中 的 基 
本 假设 。 在 SPE 模 式 中 ， 音 系 规则 只 能 在 句法 部 分 之 后 的 音 系 部 分 里 应 用 。 然 
而 ， 此 后 的 研究 表明 ， 某 些 音 系 规 则 的 作用 域 往往 是 由 语素 范畴 或 语素 界线 决 
定 的 ， 同 构 词 过 程 有 关 ， 而 另外 一 些 音 系 规则 却 与 构 词 过 程 无 关 。 这 就 是 说 ， 
前 者 是 在 词 库 中 依照 词 项 的 语素 信息 发 生 作 用 。 这 样 的 音 系 规则 称 作 词 库 规 
则 。 与 构 词 过 程 无 关 的 音 系 规则 称 作 后 词 库 规则 。 

不 同 范畴 的 语素 提供 不 同 的 信息 ， 而 且 不 同 范畴 的 语素 在 构 词 过 程 中 
出 现 的 顺序 不 同 。 同 一 范畴 的 语素 构成 特定 词汇 规则 的 作用 域 ， 称 做 层面 
Gstratum)。 同 构 词 过 程 有 关 的 不 同 语素 范畴 构成 若干 个 层面 。 概 括 起 来 讲 ， 
构 词 过 程 和 ( 词 库 和 后 词 库 ) 音 系 规则 可 以 用 下 图 表示 ( 引 自 Pulleyblank， 
1986 : 5-6) : 
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LEXICON 


ed lexical 
est PHONOLOGY 


rule 1 (domain: strata i, j ...) 


rule 2 (domain: strata k, ... ) 


tule m (domain: strata p, q, r ... ) 


Post Lexical Rules 


严格 区 分 词 库 规则 和 后 词 库 规 则 是 词 库 音 系 学 的 基本 原则 。 集 各 家 之 说 ， 
二 者 的 主要 差别 表现 在 以 下 方面 (本 书 涉 及 了 其 中 六 个 方面 ) : 


词 库 规则 后 词 库 规则 

(1) 需要 参照 词 项 内 部 的 语素 信息 不 需要 参照 词 项 内 部 的 语素 信息 
(2) 作用 域 不 能 大 于 词 界 作用 域 可 以 大 于 词 界 
(3) 可 以 循环 应 用 不 能 循环 应 用 
(4) 如 果 循环 应 用 ， 则 必须 遵循 严格 非 循环 性 的 ， 因 此 是 按 特定 顺序 

循环 条 件 一 次 性 应 用 
(5) 应 用 受到 结构 维持 条 件 的 限制 应 用 不 受到 结构 维持 条 件 的 限制 
(6) 允许 有 例外 没有 例外 


(7) 语言 直觉 可 以 对 输出 项 做 出 判断 语言 直觉 难以 对 规则 输出 项 做 出 判断 

(8) 必须 在 所 有 的 后 词 库 规 则 之 前 应 用 必须 在 所 有 的 词 库 规则 之 后 应 用 
然而 ， 词 库 音 系 学 的 原理 一 直 是 有 争议 的 。 争 论 的 主要 问题 一 是 层面 的 

顺序 和 相互 之 间 的 关系 ， 二 是 词 库 规则 和 后 词 库 规则 各 自 的 特点 。 就 层面 顺序 
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看 ， 除 了 语素 类 别 之 外 ， 词 内 语素 之 间 的 语义 关系 对 层面 顺序 也 起 着 决定 作 
用 ， 在 层面 之 间 相 互 关 系 方面 ， 推 导 过 程 中 顺序 在 先 的 层面 有 可 能 改变 语素 的 
音 系 结构 。 在 词 库 规则 和 后 词 库 规 则 各 自 特 点 方面 ， 词 库 规 则 有 可 能 是 非 循环 
的 ， 而 某 些 后 词 库 规则 却 表现 出 循环 性 特点 。 在 某 些 情况 下 ， 词 汇 规则 明显 地 
违反 结构 维持 条 件 。 此 外 ， 有 证 据 表明 ， 后 词 库 规则 也 允许 有 例外 。 关 于 比较 
详细 的 讨论 ， 读 者 可 以 参看 Kenstowicz (1994) 和 Roca (1994)。 

词汇 音 系 学 是 最 接近 SPE 模 式 的 理论 。 然 而 ;在 否定 推导 过 程 的 优选 论 看 
来 ,层面 概念 仍然 具有 一 定 的 潜在 的 理论 意义 。 在 试图 解决 音 系 星 暗 问题 时 ， 
有 些 研究 者 提出 了 人 允许 一 种 语言 有 两 部 或 多 部 语法 的 观点 ， 即 同一 组 制约 条 件 
在 不 同 层 面 上 有 不 同 的 层级 排列 。 


第 九 章 ”声调 的 表达 

本 书 通 过 对 声调 的 主要 形式 特点 “稳定 性 ”的 讨论 引进 了 自主 音 段 音 系 学 理 
论 。 在 SPE 模 式 中 ， 由 于 (底层 和 表层 ) 表达 是 线性 的 ， 声 调 被 视 作 元 音 的 音 
段 特征 ， 因 此 ， 无 法 解释 声调 的 稳定 性 。 此 外 ， 对 于 某 些 作用 域 大 于 音 段 的 音 
系 特征 (例如 和 谐 特征 ) ， 也 只 能 通过 重复 使 用 同一 规则 的 方法 完成 从 底层 表 
达到 表层 表达 的 推导 。 总 之 ， 线 性 表达 概念 一 方面 无 助 于 对 形式 特点 的 解释 ， 
另 一 方面 导致 推导 过 程 复杂 化 。 

在 非 线 性 的 自主 音 段 音 系 理论 中 ， (底层 和 表层 ) 表达 由 若干 个 音 层 构 
Mm, 不同 范畴 的 音 系 特征 置 于 不 同音 层 上 ， 同 一 范畴 的 音 系 特征 置 于 同一 音 
层 上 。 因 此 非 线性 表达 又 称 作 多 音 层 表达 。 能 够 单独 出 现在 某 个 音 层 上 的 音 
系 成 分 具有 自主 音 段 的 地 位 。 根 据 “ 连 接 规约 "和 “禁止 连接 线 交 叉 制 约 条 件 ”， 
推导 过 程 把 不 同音 层 上 的 音 系 成 分 连接 起 来 。 连 接 说 明 推 导 过 程 是 如 何 把 不 
同 范畴 的 音 系 成 分 组 织 起 来 构成 符合 语法 的 语素 表层 形式 。 声 调 的 稳定 性 通 
过 连接 规约 得 到 解释 ， 自 主音 段 的 映射 或 扩散 使 其 作用 域内 的 语素 实现 表层 
表达 。 

自主 音 段 音 系 学 主要 依靠 非 线 性 表达 实现 对 形式 特点 解释 ， 因 此 需要 对 
表达 (特别 是 底层 表达 ) 的 形式 给 予 必要 的 限制 。 最 主要 的 限制 性 条 件 是 强制 
性 非 同 值 原则 (OCP) 。 根 据 OCP， 在 同一 自主 音 段 音 层 的 相 邻 位 置 上 禁止 出 
现 相 同 的 音 系 成 分 。OCP 的 主要 作用 在 于 ,一 方面 它 突出 了 底层 表达 的 结构 特 
点 ， 使 底层 表达 清晰 明了 ， 另 一 方面 保证 了 连接 这 一 形式 手段 在 严格 遵守 连接 
规约 的 前 提 下 得 以 实施 。OCP 最 初 只 是 针对 声调 底层 表达 提出 的 ， 后 来 的 研究 
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把 OCP 应 用 范围 扩大 到 所 有 的 在 音 系 过 程 中 起 作用 的 自主 音 段 ， 使 OCP 不 仅 是 
对 表达 的 一 种 限制 ， 而 且 在 某 些 音 系 过 程 中 起 到 至 关 重 要 的 作用 ， 具 有 功能 性 
特点 。 

OCP 最 初 是 作为 一 个 绝对 不 能 违反 的 制约 条 件 提出 来 的 。 此 后 的 研究 
发 现 ， 某 些 语言 提供 了 充分 证 据说 明 底 层 表 达 可 以 违反 OCP。 因 此 ，OCP 并 
不 是 一 种 严格 的 具有 绝对 普遍 意义 的 原则 ， 而 是 一 种 “ 弱 ” 普 遍 限制 。 据 此 ， 
违反 OCP 的 表达 属于 有 标记 表达 ， 而 遵守 OCP 的 则 属于 无 标记 表达 (参见 
Goldsmith, 1990 ; Kenstowicz, 1994) 。 

从 优选 论 的 角度 看 ， 由 于 OCP 不 仅 既 作用 于 底层 表达 又 作用 于 表层 表达 ， 
而 且 作 为 “ 弱 ” 普 遍 限 制 的 可 违反 性 特点 符合 优选 论 所 理解 的 制约 条 件 所 具备 的 
基本 特征 ， 因 此 ，OCP 可 以 视 作 制约 条 件 。 对 此 ， 目 前 研究 者 的 意见 并 不 一 
致 。 (参见 McCarthy， 2002) 


第 十 章 TRAM ZH 
从 理论 上 讲 ， 所 有 的 音 系 特征 (包括 音 段 的 和 非 音 段 的 ) 可 以 出 现在 各 自 


的 自主 音 段 音 层 上 。 如 果 是 这 样 的 话 ， 其 结果 是 ， 主 干 音 层 除 了 为 每 个 自主 音 
段 音 层 提供 一 个 定位 以 外 ， 其 本 身 几乎 没有 任何 音 系 信 息 。 那 么 主干 究竟 包含 
何 种 音 系 信息 以 及 其 在 音 系 过 程 中 的 作用 便 成 为 非 线 性 理论 讨论 的 主要 问题 。 
本 章 从 几 个 不 同 的 角度 讨论 这 个 问题 。 

CV 音 系 学 把 主干 音 层 定义 为 位 于 音节 音 层 和 音 段 音 层 之 间 的 一 个 层次 ， 
叫做 CV 音 层 。CV 音 层 由 C 位 和 V 位 构成 。 从 线性 方面 看 ，C 位 和 V 位 是 时 量 单 
位 ， 它 们 决定 音 段 的 长 度 。 从 结构 方面 看 ，C 位 和 V 位 是 音节 的 直接 成 分 ， 同 
时 又 是 音 段 成 分 的 根 节 点 。CV 音 层 概 念 在 表达 复杂 音 段 、 处 理 音 段 脱落 、 增 
音 以 及 补偿 性 音 段 加 长 等 方面 的 问题 上 具有 一 定 的 解释 力 ， 在 音节 结构 分 析 方 
面 ， 通 过 CV 音 层 概念 ， 关 于 一 个 辅音 可 以 分 属 两 个 不 同音 节 的 语言 直觉 可 以 
得 到 形式 化 表述 。 ) 

在 非 串 联 语素 结构 中 ， 主 干 音 层 被 定义 为 由 C 位 和 V 位 构成 的 具有 形态 功 
能 的 模 架 ， 即 每 个 模 架 就 是 一 个 语素 。 模 架 中 的 C 位 和 V 位 通过 与 其 他 音 层 上 
音 系 成 分 的 连接 ， 共 同 构成 一 个 完成 的 词 。 

在 传统 音节 理论 中 ， 音 节 由 音节 首 音 和 齐 部 构成 ， 竟 部 又 由 音节 峰 和 音 
节 尾 音 构成 , 如 (a) 所 示 ; CV 音 系 学 抛弃 了 韵 部 概念 ， 音 节 直 接 由 C 位 和 V 位 构 
成 ， 同 一 结构 类 型 的 音节 如 (b) 所 示 : 
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然而 ， 关 于 由 C 位 和 V 位 直接 构成 音节 的 假设 难以 解释 某 些 事实 。 例如 ， 
补偿 性 音 段 加 长 总 是 在 韵 部 内 发 生 ， 重 音 位 置 往往 同 由 韵 素数 量 决定 的 音节 重 
量 有 关 ， 而 这 些 现象 与 音节 首 音 无 关 。 据 此 ， 位 于 音节 层面 和 音 段 层面 之 间 的 
应 当 是 韵 素 层 面 ， 而 CV 层 面 仅仅 是 音 段 的 根 节点 。 如 (c ) 所 示 : 


(c) o 


6 V C 


根据 这 一 理解 ， 主 干 音 层 实质 上 是 由 韵律 结构 的 基本 单位 前 素 构成 的 韵律 
音 层 。 
主干 音 层 具有 多 种 功能 是 自主 音 段 音 系 学 一 直 未 能 彻底 解决 的 问题 之 一 。 


第 十 一 章 ”特征 几何 
第 十 二 章 ”特征 树 研 究 

这 两 章 重点 讨论 特征 几何 和 不 充分 赋值 理论 。 

音 段 同化 往往 表现 出 两 个 特点 : (1) 根据 OCP， 与 同化 过 程 相关 的 音 
段 共 享 某 个 或 某 些 语音 特征 ， (2) 同化 过 程 涉 及 的 往往 是 一 组 语音 特征 。 在 
SPE 模 式 中 ， 音 段 是 一 个 没有 内 部 结构 的 特征 集 。 显 然 ， 处 于 无 结构 状态 的 特 
征集 难以 解释 某 些 同化 过 程 同时 涉及 若干 个 特征 的 现象 。 这 一 问题 的 提出 导 
致 产生 了 以 音 段 结构 为 研究 对 象 的 特征 几何 理论 (Clements, 1985; Sagey, 
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1986) 。 

特征 几何 在 形式 上 是 一 个 树 形 图 。 树 形 图 中 ， 语 音 特 征 通过 若干 个 属于 上 
位 节点 的 特征 群 的 方式 组 织 起 来 。 树 的 顶点 代表 整个 音 段 的 根 节点 ， 根 节点 制 
约 着 若干 个 高 层次 的 次 位 节点 ， 这 些 次 位 节点 又 分 别 制约 着 一 组 终端 成 分 。 从 
逻辑 的 角度 看 , 特征 几何 模型 的 必然 结果 是 所 有 的 特征 以 分 枝 树 形 的 方式 相互 
联系 着 , 每 一 个 特征 在 树 形 结构 中 都 有 各 自 的 位 置 。 这 种 方法 使 所 有 的 特征 结 
合 起 来 构成 一 个 完整 的 音 段 结构 体 。 

特征 几何 以 语音 产生 机 制 为 基础 。 但 是 ， 在 次 位 节点 和 终端 成 分 的 归属 等 
具体 问题 上 ， 研 究 者 的 意见 并 不 一 致 。 本 书 综合 多 家 的 观点 ， 重 点 讨论 了 部 位 节 
点 、 喉 部 机 制 节点 和 喉 上 机 制 节点 以 及 这 些 节点 同 终端 节点 上 语音 特点 之 间 的 关 
系 。 第 十 二 章 从 远 距离 同化 、 元 音 和 谐 以 及 复杂 音 段 等 方面 进一步 讨论 了 音 段 的 
内 部 结构 。 检 验 特征 几何 假设 的 主要 标准 是 次 位 节点 的 扩散 是 否 能 够 解释 音 系 过 
程 中 相关 音 段 共享 一 组 语音 特征 的 事实 ， 而且 这 种 解释 必须 具有 普遍 意义 。 

与 扩散 相关 的 是 底层 表达 的 形式 。 根 据 OCP， 在 同一 音 层 上 ， 在 相 邻 位 置 
上 不 能 出 现 相 同 的 音 系 成 分 。 在 这 个 问题 上 ， 本 书 介绍 了 底层 表达 极端 不 充分 
赋值 理论 (Archangeli, 1984) 。 根 据 这 一 理论 ， 在 能 够 区 别 对 比 性 音 段 或 语 
素 的 前 提 下 ， 底 层 表 达 中 音 系 特 征 的 数量 越 少 越 好 ( 即 底层 表达 特征 最 小 量 原 
则 ) ， 而 表层 表达 所 需要 的 语音 材料 则 由 缺 省 规则 提供 。 极 端 不 充分 赋值 理论 
的 目的 在 于 : (1) 简化 底层 表达 ， 突出 底层 表达 音 系 特点 ， (2) 解释 音 段 
的 非 对 称 性 。 需 要 说 明 的 是 ， 在 极端 不 充分 赋值 理论 里 ， 缺 省 规则 是 包括 标记 
性 规则 在 内 的 一 大 类 规则 。 也 就 是 说 ， 底 层 表 达 的 简化 和 音 段 非 对称 性 的 解释 
是 以 扩大 缺 省 规则 的 数量 和 应 用 范围 为 代价 的 。 极 端 不 充分 赋值 理论 的 主要 缺 
陷 有 二 : 一 是 扩大 缺 省 规则 应 用 范围 导致 音 段 关系 混乱 ， 二 是 底层 表达 的 极端 
简化 导致 对 表层 表达 形式 的 错误 预测 。 关 于 极端 不 充分 赋值 理论 更 为 详细 的 讨 
论 ， 读 者 可 以 参看 Roca (1994), 


第 十 三 章 ”重音 和 音 步 

本 章 主要 介绍 重音 和 音 步 两 个 基本 概念 以 及 线性 和 非 线性 理论 对 重音 分 布 
的 分 析 。 重 音 的 语音 表征 是 多 方面 的 。 例 如 ， 在 英语 里 重读 音节 除 在 气流 能 量 
较 大 和 听觉 上 响 度 较 高 以 外 ， 还 表现 为 音调 变 高 ， 元 音 长 度 增加 和 元 音 音 质变 
化 等 语音 表征 。 

决定 词 重音 分 布 的 因素 究竟 是 什么 ?这些 因素 和 重音 分 布 之 间 存 在 何 种 关 
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Al 这 是 本 章 讨论 的 主要 问题 。 本 章 重点 讨论 SPE 线 性 处 理 方 法 和 非 线性 的 节 
律 音 系 学 分 析 方 法 。 

重音 分 布 分 析 建 立 在 对 重音 性 质 认识 的 基础 上 。SPE 模 式 中 ， 重 音 被 看 成 
元 音 音 段 的 绝对 特征 ， 和 其 他 音 段 特征 同时 出 现在 元 音 的 特征 矩阵 里 。 重 音 特 
征 通过 一 系列 重音 规则 或 某 一 条 规则 的 重复 应 用 指派 给 某 个 元 音 。 对 SPE 模 式 
的 这 种 分 析 ， 批 评 者 认为 ， 首 先 ， 重 音 是 一 个 相对 概念 ， 而 且 重 音 分 布 同 音节 
结构 和 音节 重量 有 关 ， 因 此 把 重音 视 作 音 段 的 绝对 特征 是 错误 的 。 其 次 ， 重 
音 规 则 只 不 过 是 对 重音 分 布 现象 的 某 种 表述 ， 实 际 上 并 没有 对 重音 分 布 做 出 解 
释 ， 以 至 不 可 能 对 重音 分 布 做 出 预测 。 第 三 ， 基 于 重音 的 相对 性 特点 ， 与 重音 
有 关 的 事实 还 应 包括 次 重音 、 非 重读 和 弱化 音节 。 特 别 是 重音 的 交 蔡 ， 说 明 重 
音 模式 具有 侦 分 结构 的 特点 。 但 是 ，SPE 模 式 的 线性 分 析 没 有 涉及 这 个 问题 。 

在 节律 音 系 学 中 ， 重 音 被 视 作 把 节律 位 置 (音节 ) 组 织 成 一 个 具有 中 心 成 
分 结构 ( 音 步 ) 的 结果 。 音 步 是 一 个 基本 和 核心 概念 。 音 步 是 由 音节 构成 的 节 
律 结构 单位 ， 音 步 中 的 中 心 成 分 负载 重音 。 节 律 音 系 理论 主要 有 两 种 方法 ， 一 
种 是 节律 栅 分 析 法 (Halle & Vergnaud, 1987; Halle, 1990) ; 另 一 种 是 节 
律 树 分 析 法 (Hayes, 1981; Prince, 1990) 。 二 者 主要 差别 在 于 ， 前 者 认为 
重音 反映 了 节律 栅 上 中 心 成 分 的 标记 性 ， 而 后 者 认为 重音 是 不 同 重量 的 音 步 决 
定 的 节奏 现象 。 本 书 介绍 了 节律 树 分 析 法 。 

在 节律 树 分 析 法 中 ， 重 音 相 对 性 和 重音 交替 通过 节律 树 的 形式 得 以 表 


达 。 节 律 树 的 基本 形式 特点 是 : (1) 双向 分 枝 ， (2) 每 个 节点 有 一 个 支配 
作用 的 (dominant) 强 分 枝 (S) 和 一 个 被 支配 的 (recessive) 弱 分 枝 (W)。 节 
律 树 的 内 部 结构 同 四 项 参数 有 关 : (1) 支配 性 分 枝 在 左边 还 是 在 右边 (eft- 


dominant vs right-dominant); (2) 主要 重音 位 置 与 词 界 有 关 还 是 无 关 (bounded 
vs unbounded) ; (3) 在 与 词 界 有 关 的 重音 系统 里 ， 重 音 与 词 的 左边 界 有 关 还 
是 与 右边 界 有 关 (left-edge vs right-edge); (4) 重音 位 置 是 否 同音 步 内 部 结构 

音节 重量 ) 有 关 (quantity-sensitive vs quantity-insensitive)。 如 果 不 考虑 方向 参 
数 ( 即 词 的 左边 界 或 右边 界 ) ， 与 词 界 有 关 的 重音 系统 的 节律 树 可 以 分 成 四 种 
可 能 的 基本 结构 : QI-ld，QI-rd， QS-ld， QS-rd。 节律 音 系 学 的 主要 特点 在 于 
通过 形式 手段 表达 重音 相对 性 特点 ， 成 为 重音 分 布 分 析 的 原则 。. 


第 十 四 章 ” 对 重音 的 进一步 限制 
本 章 介绍 节律 音 系 学 重音 分 布 理 论 的 发 展 (Hayes, 1995) 。 在 感知 的 基 
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础 上 ， 话 语 的 节奏 分 为 扬 抑 格律 和 抑 扬 格 律 两 种 类 型 ， 加 之 音节 重量 参数 ， 共 
有 三 种 类 型 的 音 步 。 此 外 ， 在 音 步 结构 的 形式 表达 方面 ， 加 括 栅 取代 了 树 形 
图 。 基 本 分 析 程序 是 : 首先 确定 语言 的 音 步 结构 类 型 ， 然 后 通过 最 终 规则 推导 
出 音 步 结 构 的 显 突 位 置 ， 以 此 确定 词 的 主要 重音 和 其 他 重音 。 作 者 认为 ， 这 种 
新 的 分 析 方法 建立 在 语音 学 基础 之 上 ， 使 音 步 结构 的 形式 表达 体现 了 其 语音 功 
能 。 然 而 ， 训 减 音 步 和 节律 外 成 分 现象 仍然 是 节律 形式 描写 理论 需要 进一步 研 
究 的 问题 。 

音 步 结构 类 型 不 仅 是 影响 重音 分 布 的 主要 参数 之 一 ， 而 且 对 增 音 和 音 段 脱 
落 也 起 到 一 定 的 作用 。 某 些 音 段 规则 的 应 用 必须 以 音 步 类 型 为 基础 。 

构 词 过 程 可 能 影响 重音 分 布 。 在 某 些 语 言 里 ， 虽 然 单 语素 词 和 派生 词 的 音 
步 结构 相同 ， 但 二 者 的 重音 分 布 模式 却 不 一 样 ， 进 一 步 看 ， 有 些 构 词 过 程 影响 
重音 分 布 ， 而 另外 一 些 构 词 过 程 则 对 重音 分 布 没有 任何 影响 。 对 此 ， 节 律 音 系 
学 把 构 词 过 程 划分 成 两 类 并 分 别 置 于 循环 性 模块 和 非 循环 性 模块 内 。 在 循环 性 
模块 中 ， 在 第 一 个 构 词 过程 完 成 后 ， 重 音 消 抹 规 约 消除 原 有 的 韵律 结构 ， 然 后 
根据 重音 规则 重新 建立 韵律 结构 ， 如 此 反复 直至 最 后 一 个 构 词 过 程 完成 。 不 改 
变 一 般 重音 模式 的 构 词 过 程 是 在 循环 性 模块 内 完成 的 。 在 非 循环 性 模块 里 ， 重 
音 消 抹 规 约 没有 作用 ， 因 此 原 有 的 韵律 结构 保持 不 变 ， 重 音 规则 只 把 游离 成 分 
纳入 韵律 结构 之 中 。 改 变 一 般 重音 模式 的 构 词 过 程 是 在 非 循环 性 模块 内 完成 。 

本 章 最 后 从 优选 论 的 角度 分 析 了 重音 分 布 。 在 优选 论 的 分 析 框 架 内 ， 重 
音 分 布 是 两 大 类 制约 条 件 相 互 作用 的 结果 。 一 类 是 作用 于 音 步 结构 的 制约 条 
件 ， 另 一 类 是 同 界 制约 条 件 。 同 界 制约 条 件 的 基本 思想 是 : 语法 范畴 (EBA 
括 词根 、 词 干 、 词 级 和 语素 词 等 语素 范畴 ) 必须 和 韵律 范畴 (主要 包括 韵 素 、 
音节 、 音 步 、 音 系 词 等 ) 享有 共同 的 ( 左 或 右 ) 边界 。 同 界 制约 条 件 是 本 章 讨 
论 的 重点 之 一 。 同 界 制约 条 件 理论 是 优选 论 中 比较 成 熟 的 理论 ， 读 者 可 以 参考 
McCarthy (1993), 


第 十 五 章 ” 比 音 系 词 更 大 的 音 系 结构 单位 

本 章 介 绍 前 律 音 系 学 的 基本 理论 。 韵 律 音 系 学 的 基本 假设 是 : 句子 的 前 
律 结构 相对 地 独立 于 句子 的 句法 结构 , 韵律 结构 和 句法 结构 之 间 的 对 应 是 间接 
的 。 最 大 的 韵律 单位 是 音 系 句 段 ， 音 系 句 段 由 句 调 短语 构成 ， 句 调 短语 由 音 系 
短语 构成 ， 音 系 短语 又 由 音 系 词 构成 。 不 同 的 韵律 结构 单位 处 于 不 同 的 结构 层 
次 上 。 本 书 重点 介绍 了 严格 层次 假设 。 根 据 严 格 层 次 假设 ， 每 一 个 较 低层 次 上 
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的 韵律 结构 单位 必须 受到 较 高 层次 上 彰 律 结构 单位 的 辖 制 。 不 同 层 次 上 的 韵律 
结构 单位 界定 了 特定 音 系 过 程 的 域 。 确 定 韵律 结构 单位 通常 需要 考虑 三 个 方面 
WAR: (1) 形态 一 句法 结构 ， (2) 句法 短语 的 长 度 ， (3) 句子 的 信息 结 
构 。 不 同 层 次 上 的 韵律 结构 单位 具有 特定 的 语音 、 音 系 或 句法 表征 。 例 如 ， 较 
低层 次 上 的 韵律 结构 单位 通常 表现 出 较 强 的 边界 力度 ， 再 如 ， 音 系 短 语 和 和 句 调 
短语 通常 具有 边界 调 ， 又 如 ， 在 后 词 库 模 块 里 ， 韵 律 结构 单位 通常 与 音 段 的 分 
布 有 关 ， 而 且 界 定 音 系 过 程 的 域 。 此 外 ， 韵 律 结构 单位 有 可 能 对 句法 成 分 的 分 
布 产 生 一 定 的 作用 。 本 章 的 重点 是 界定 和 识别 不 同 层次 上 韵律 结构 单位 以 及 通 
过 韵律 结构 对 相关 的 现象 作出 解释 。 
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Preface 


There are about 6000 languages in the world today. Almost certainly, no two of them 
have the same sound structure: they vary widely in the number of consonants and vow- 
els they have, in their use of tonal contrasts, in their stress patterns, in the shape of their 
syllables, and so on. At the same time, all these languages show striking similarities in 
the way they structure their sound systems. Phonology is a thriving field of linguistic 
research that tries to understand the structure behind these systems. How do these 
similarities arise? Or again, why is there so much variation? How is our knowledge of 
the pronunciation of our language represented in our brain? How can we describe the 
pronunciation of a language? What do people do when they play language games? Why 
do loanwords often sound so totally different from the way they are pronounced in the 
donor language? These and many other questions are dealt with in this book. In our dis- 
cussion, we have tried to sketch the development of scientific thinking about the sound 
structure of languages and to take an unbiased view of the cognitive or physiological 
nature of the explanations. We hope we have succeeded in this task in at least some 
places in the book, and have got close enough to this ideal for it to serve as a reliable and 
relevant introduction to an important and exciting field. 

An understanding of phonological theory is impossible without at least some 
knowledge of the way speech is produced. Chapter 1 gives the basics of phonetics. It is 
a background introductory chapter and so, unlike the other chapters, does not contain 
exercises. 

The exercises are numbered consecutively through the book. A key is provided at 
the back. 

In the interest of our students, we have not followed the otherwise commendable 
practice of reporting data from the literature in their original transcriptions, but have 
used International Phonetic Alphabet (IPA) transcriptions (see p. 16) throughout. 
We hope that not too much was lost in these translations. 

This book reflects the thoughts and discoveries of many phonologists. We have 
learnt to appreciate the value and implications of these theoretical positions not only 
by reading their publications, but also by attending their classes and discussing the 
issues with them. Needless to say, our debt to them is inestimable. However, we have 
not just attempted to present the material in a coherent and accessible fashion, but 
have at times included original data and analyses. 
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PREFACE TO SECOND EDITION 


In this second edition, we have made various corrections and revised a number of 
passages, among which the discussion on rule ordering, which has been enriched with 
a passage on opacity, the feature analysis of affricates in chapters 5 and 11, and 
Chapter 15. We have added a number of new exercises, numbered the exercises con- 
secutively throughout the book, and provided a key for all of them. The second edition 
has benefited from the comments made by Elan Dresher, San Duanmu, Ed Flemming, 
Ingmar Steiner, Bo Hagström, Victoria Rosén, Rik van Gijn, Jeroen van de Weijer, Leo 
Wetzels, Maria Wolters, and those who responded to the publisher’s questionnaire. 
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The production of speech 


1.1_INTRODUCTION 


This preliminary chapterlooks briefly into the question ofhow we produce speech.An 
understanding of the speech production process helps us to see the relations between 
the speech sounds occurring in human languages, and enables us to categorize them 
on the basis of their articulation. It also allows us to understand the adaptations they 
undergo when they occur in combination with other speech sounds. We will describe 
the speech production process in two stages. First, we consider the role of the lungs 
and the larynx. Second, we deal with the role of the channel extending from the larynx 
onwards, called the vocal tract, which is formed by the pharynx, the mouth and - for 
some sounds ~ the nasal cavity. The term organs of speech is used to refer to parts of 
the body in the larynx and the vocal tract that are involved in the production of 
speech. It is a misleading term in that it suggests that we have special physical organs 
for speaking. This is not so: all our so-called ‘organs of speech’ have primary bio- 
logical functions relating to our respiratory system and the processing of food. This 
chapter contains a highly simplified account of the speech production process. For 
a more complete introduction, see Ladefoged (1993), Catford (1988), Laver (1994) or 
Ladefoged and Maddieson (1996). 

The pronunciation of words is conventionally represented with the help of phonetic 
symbols, any such representation being a phonetic transcription. The symbols used 
in this book are those proposed by the International Phonetic Association (IPA), 
which can be found on p. 16. A phonetic symbol stands for a particular speech sound, 
or segment, which is defined independently of any language. Phonetic symbols may 
be accompanied by diacritics, signs which are printed above or below a phonetic 
symbol or with which the symbol is superscripted, and which specify particular fea- 
tures of pronunciation. For example, in the transcription [k"æt], which indicates the 
pronunciation of the English word cat, [k"] represents a [k] which is accompanied 
by aspiration, a brief [h]-like sound occurring between the [k] proper and the 
following [æ]. It is not always necessary, or even desirable, to indicate all the features 
of the pronunciation of a word in a transcription: the transcription [ket] is often 
sufficiently informative if the reader knows the language concerned. A transcription 
that includes a great deal of detail is called narrow. . 


2 The production of speech 


1.2 THE LUNGS AND THE LARYNX 


A crucial requirement for the production of speech is a mechanism to create an air 
pressure difference in the appropriate locations in the larynx and vocal tract. This air 
pressure difference is used for three purposes. First, it can be used to drive the vibratory , 
opening and closing actions of organs like the vocal cords, as explained in section 1.2.1 
below. During the articulation of [m], for instance, vocal-cord vibration can easily be 
felt by placing one’s fingertips on one’s ‘Adam’s apple’. Second, it can be used to generate 
a flow of air that can be channelled through a narrow opening to create audible air tur- 
bulence, or friction. This happens during the articulation of [s], for instance. Third, it 
can be used to build up pressure behind a complete blockage of the vocal tract in order 
to create an explosive ‘popping’ sound when the blockage is suddenly removed. This 
occurs in the articulation of plosive consonants like [p], such as when we say [pa]. 

By far the greater majority of speech sounds are produced with the help of increased 
air pressure created by our lungs. Before we begin to speak, we breathe in, taking in 
sufficient air to produce an utterance of reasonable length. Instead of simply letting go 
of the muscular tension and allowing our lungs to collapse, pushing the air from them 
(which is what we would do if we were breathing normally), we slowly ease up on the 
tension, thereby slowing down the exhalation phase. This artificially extended period 
of pressure from our lungs is used to produce speech. Because the air pressure differ- 
ence is generated with the help of the lungs, speech sounds produced in this way are 
called pulmonic, and because it is the exhalation phase rather than the inhalation 
phase that is used, these speech sounds are called egressive. Most languages, includ- 
ing German, French and English, only have pulmonic egressive sounds. In section 1.8, 
we will briefly describe the production mechanisms of three types of nonpulmonic 
sounds (clicks, implosives and ejectives). 

After passing through the bronchi and the trachea, the first organ the airstream will 
meet on its path from the lungs is the larynx. The outward part of this organ can be 
felt ~ and, especially in men, be seen - at the front of the neck (the Adam’s apple). The 
larynx is a valve, which can be opened and closed by two thickish flaps that run from 
back to front inside the larynx (see Fig. 1.1). These flaps are primarily there to prevent 
food or saliva from entering the lungs, but because they also have a function in speech 
they are known as the vocal folds or the vocal cords.' The aperture between them is 
called the glottis. No air can pass through the glottis when it is closed, while the air 
can flow quite freely through an open glottis. 


1.2.1 The vocal folds: the open and vibrating glottis 


There are many consonants that are produced with the glottis held open, as in ordin- 
ary breathing. Such sounds are called voiceless, and we hear them because other 


| There is an additional valve, called the epiglottis, positioned above the larynx where the root of the tongue 
begins. It normally points upwards, but it flaps down to channel food and saliva into the esophagus — the tube 
behind the larynx leading to the stomach - when we swallow. 
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(b) 


Figure 1.1 
Schematic drawings of (a) a closed glottis, as during the closure stage of a glotttal stop or during the 
closed phase of the vibrating glottis; (b) an open glottis; (c) a narrowed glottis. 


speech organs, the tongue or the lips, are used to generate sound further up in the 
vocal tract. Examples of voiceless sounds are [f] and [f] in fish and [st] in stay. 

The vocal cords are exploited in various ways to create sound which can be used as 
a basis for speech. This is known as phonation. The most important type of phon- 
ation is voice, which is produced when the vocal folds vibrate. Vocal-fold vibration 
occurs when the closed glottis is subjected to increased subglottal air pressure which 
is sufficient to blow the vocal folds apart, but not enough to prevent them from falling 
together again when the air pressure between them drops as a result of the Bernoulli 
effect, a physical effect which causes pressure mimima at points where the flow of 
gases or liquids is high. As soon as they have been sucked together, the vocal folds are 
once more blown apart as a result of the subglottal air pressure. This process typically 
repeats itself more than 100 times per second for the larger and laxer vocal folds of 
men, and over 200 times per second for the smaller vocal folds of women. Consonants 
like [m], [I] and [j] (which we will see are sonorant consonants) and vowels are nor- 
mally voiced. Voiced obstruent consonants also exist, such as [b] in English abbey and 
[z] in lazy, and usually contrast with their voiceless counterparts (in these cases with 
[p] and [s], as in happy, lacy). 


1.2.2 Devoicing and aspiration 


When a consonant that is normally voiced is pronounced without vocal-fold vibration 
in some context, it is said to be devoiced. Devoiced segments are symbolized with a 
circle below the symbol (which may also appear above it, if there is no space below it). 
For example, devoicing may follow voiceless obstruents, in particular plosives. In 
many languages, the vocal folds may begin to vibrate immediately after the release of 
the closure made for the plosive for a following voiced segment, as in [pa], but in other 
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languages the vocal folds may remain open for a while. In the latter case, the plosives 
are said to be aspirated. As shown above, aspirated plosives are symbolized with a 
superscript [h], as in [ph a]. English has voiceless aspirated plosives at the beginning 
of the syllable, as in tea, pea, key. If a sonorant consonant rather than a vowel follows 
the aspirated plosive, the aspiration is indicated by means of the devoicing diacritic, as 
in English [plei] play. 

The timing relation between plosive releases and the onset of vocal-fold vibration is 
expressed as Voice Onset Time, or VOT. The VOT is zero when the plosive release and the 
onset of vocal-fold vibration are simultaneous; when the onset of vocal-fold vibration is 
earlier than the release of the plosive, VOT is negative, and when it is later, it is positive. 
Typical values for aspirated plosives are between VOT +50ms and VOT +80ms. 


1.2.3 Special types of phonation 


Three special types of phonation are mentioned here, whisper, breathy voice and 
creaky voice. 


1. The vocal folds can be brought together to form a narrowing which produces 
friction when air passes through it. This is how people whisper: instead of voice, 
glottal friction may be used as the acoustic source to be modified. A whispered 
speech sound occurring in otherwise voice-phonated speech is [h], as in 
English [het] hat. Whisper can be indicated by the devoicing symbol, as has 
been done in [gæt]. 


2. Breathy voice occurs when the closing phase of the vibration is not complete, 
so that air is allowed to flow through with friction during phonation. Breathy 
voice is used in European languages to signal confidentiality (Laver 1994: 200), 
and is sometimes jocularly used to create the effect of a sexy voice. Voiced aspi- 
rated plosives, which occur in many languages spoken on the Indian subconti- 
nent, are produced by allowing breathy voice to be used throughout the plosive 
and the following vowel, as in Hindi [bfal] ‘forehead’. Breathy voice can be indi- 
cated by [..]. 


3. Creaky voice or laryngealized voice is produced with tight vocal folds, and 
often allows the listener to hear the opening actions of the vocal folds as sep- 
arate events. (The effect may remind one of the sound produced when running 
a fingernail across the teeth of a comb.) British English speakers may break into 
creak at the ends of their utterances, when the pitch is low. Many Nilotic lan- 
guages use laryngeal voice contrastively (Ladefoged 1971). Dinka has a set of 
vowels with creaky voice, symbolized by means of [_], which contrast with a set 
with breathy voice, as in [rd:r] ‘forest’ - [rd:r] ‘men’ (Andersen 1987; long seg- 
ments are followed by [:]). 


1.2.4 Pitch 


The number of times the vocal folds open and close per second determines 
the frequency of vibration, or the fundamental frequency, and is expressed in 
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Hz (short for hertz). Variations in the frequency of vibration are heard by the listener 
as variations of pitch: the more frequently the vocal folds open and close, the higher 
the pitch. In languages like English, French, German and Spanish, variations in pitch 
are used to signal different discoursal meanings. For instance, in the utterance But I 
don't want it!, the syllable want will be higher than the syllable it, but in Want it?, it will 
be the higher syllable. These two intonation patterns are known as the ‘fall’ and the 
‘tise’, and add a ‘declarative’ and ‘interrogative meaning to the utterance, respectively. 
In other languages, called tone languages, different pitch patterns are used in the 
same way vowels and consonants are used in all languages, i.e. to distinguish words 
from one another. Dinka has low tone [`], high tone ['] and falling tone [*], as illus- 
trated by [row] ‘two’ vs [row] ‘thirst’ and [yd:w] ‘bones’ vs [yé:m] ‘wind’ (Andersen 
1987). Karen has three level tones, high, mid ["] and low, as illustrated by [t3] ‘one’, [t3] 
‘spoon’ and [t3] ‘ant’ (Jones 1961). 


1.2.5 The glottal stop 


It is possible during phonation to suddenly close the glottis, hold that closure briefly 
and then, equally suddenly, allow the vocal folds to vibrate again. The resulting speech 
sound is known as the glottal stop, symbolized by [?]. English has the paralinguistic 
item [?m?m], with the first [m] higher in pitch than the second, meaning ‘no’, used in 
informal settings. In German, [?] occurs before the second syllable of Beamter 
([bə'?amtəg]). A glottal stop requires voicing on one side only; hence it can be initial, 
as in Hawaiian [?u] ‘to moan and final, as in Tsou [su?] ‘to fall’. 

In summary, the larynx produces either no sound (open glottis) or a phonated 
sound. In the former case, the segment is said to be voiceless, and the acoustic energy 
created for the speech sound lies at a location above the larynx. In the other case, the 
phonation could be normal voice, breathy voice or creaky voice, in which cases the 
speech sound is voiced, or whisper. Finally, a brief closure before or after voicing rep- 
resents a glottal stop. In the next section, we consider the ways in which an acoustic 
source can be created at a location above the larynx, as well as the ways in which the 
phonated laryngeal sound source can be modified by differences in the shape of the 
supralaryngeal cavities, which together form the vocal tract. 


1.3 THE VOCAL TRACT 


The channel from the larynx onwards is called the vocal tract, which extends all the 
way to the lips (see Fig. 1.2). It consists of the pharynx and the mouth, to which an 
extra tube extending to the nostrils may be coupled, the nasal cavity. This (poten- 
tially bifurcating) tube acts as a resonator, modifying the sound produced at the glot- 
tis. Most spectacularly, the sound produced by a vibrating glottis can be modified by 
changing the position of the tongue or jaw so as to produce a range of different vowel 
qualities. Second, inside the vocal tract there are further opportunities for generating 
sound, which - in their turn - will be modified by the shape of the vocal tract in front 
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Figure 1.2 
Cross-section of the vocal tract. 


of the sound source. For instance, the tongue can be brought up against the roof of the 
mouth to form a constriction that generates friction when air is passed through it. 
In this section, we describe the three parts of the vocal tract mentioned above, and in 
section 1.4 we deal with the positions that the tongue, jaw and lips may assume to pro- 
duce different vowels. In section 1.5 we identify the various places at which languages 
make articulatory constrictions. In the same section, we will classify the types of 
constriction that are used. 
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1.3.1 The pharynx 


The pharynx is the vertical part of the tube extending up from the larynx to the velum. 
The forward wall is formed by the root of the tongue, which faces the back wall of the 
pharynx. 


1.3.2 The nasal cavity 


The soft palate, or velum, is a valve which closes off the entrance to the nasal cavity 
when it is pressed up, but opens the cavity when it is allowed to hang down, as in ordin- 
ary breathing (When we have a cold, the entrance to the nasal cavity may be blocked 
by mucus, which forces us to breathe through the mouth, and hence to produce only 
oral sounds). The velum ends in a pear-shaped little blob of flesh, which can be seen 
during the speaker’s articulation of [a:]. It is called the uvula. 


1.3.3 The mouth 


The mouth is the most important part of the vocal tract because it is here that the 
most drastic modifications of its shape are achieved and the majority of the articula- 
tory contacts are made. The roof of the mouth is formed by the soft palate, with the 
uvula at the extreme end, and the hard palate, which lies to the front of the soft palate 
(also known simply as the palate). With a curled-back tongue, it is possible to feel the 
hard palate arching back to where the soft palate begins. Immediately behind the front 
teeth is the alveolar ridge, which is touched by the tongue during the pronunciation 
of dada; then there are the front teeth themselves and the upper lip. Below these 
parts there are the more active speech organs: the lower lip and the tongue. The zone 
immediately behind the tip of the tongue is called the blade. You use it when imitat- 
ing the sharp, hissing sound of a snake. Together, tip and blade are called the crown, 
which term is from Clements (1989). The part of the tongue opposite the hard palate 
is called the front, the part opposite the soft palate is called the back. The section 
comprising both front and back is known as the dorsum. 


1.4 VOWELS 


For vowels, the vocal folds vibrate. The crown is held behind the lower teeth, while the 
dorsum is bunched, forming a constriction that allows a frictionless escape of the air. 
The lips may be rounded, causing the vocal tract to be lengthened. The location of the 
bunch can be varied in a vertical dimension, as well as in a horizontal dimension. In 
the British English phonetic tradition, four steps are recognized in the vertical dimen- 
sion: close, close-mid, open-mid and open. This can be seen in the top right-hand 
corner of the IPA chart on p. 16. In the American English tradition, the three terms 
high, mid and low are used to distinguish three vowel heights. Horizontally, the 
bunch can go from front to back, with central being used as an in-between value. If 
we disregard the open vowel [a], most languages just have front unrounded and back 
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rounded vowels. Italian, for instance, has the vowels shown in (1). The position of the 
lips is rounded for [u], [0] and [9], and unrounded for [i], [e], [e] and [a]. (Deviating 
from IPA conventions, authors normally use the symbol [a] to represent a central or 
central to back open unrounded vowel like [a].) 


Back unrounded Back rounded 


If the tongue positions used for [i], [e] and [£] are combined with rounded lips, 
the vowels [y], as in French [lyn] ‘moon, [ø], as in [po] ‘(a) little’ and [ce], as in 
[scel] ‘alone’ are produced. A rounded [a] is [0], and may occur in dog as pronounced 
in varieties of English spoken in England and on the east coast of the USA. Unrounded 
vowels with the bunch in the back or centre exist also. The unrounded counterpart of 
[ə] is [A], that of [o] is [x], and that of [u] is [w]. 

Rounded front vowels are somewhat more central than unrounded ones, while 
unrounded back vowels are somewhat more central than rounded back vowels. 
Frequently, of the low vowels only the back member is ever rounded, and many lan- 
guages avoid low round vowels altogether. The vowels of Korean, given in (2), illustrate 
the avoidance of low round vowels. (The vowel [æ] has merged with [e] for many 
speakers.) Example words are given in (3). 


Front Centralized Centralized 
unrounded front back 
rounded unrounded 


‘mountain’ 
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While a central position of the tongue bunch is thus quite common, central vowels pat- 
tern like back vowels if they are unrounded, and like front vowels if they are rounded. 

Diphthongs. A vowel whose quality remains stable during its production, like [æ] 
in English hat or [u:] in German [gu:t] ‘good’, is known as a monophthong. When 
two different vowels appear in the same syllable, the combination is known as a diph- 
thong. English has diphthongs in [lat] lie, [nau] now and [d391] joy, while German 
has the diphthongs [ai], [au] and [oi] in [tsait] ‘time’, [haus] ‘house’ and [biç] ‘you 
(obj. pl.)’, respectively. 

Nasalization. Vowels can be nasalized: if during their production the soft palate 
is lowered (so that the nasal cavity is opened up), a nasal quality is added to them. 
Nasalized vowels occur in Portuguese and French, and are symbolized by placing a 
tilde over the symbol. Thus, French has three nasalized vowels, occurring in [vê] 
‘wine’, [tat ] ‘aunt’ and [5t] ‘shame’. 

Rather than reproducing all the symbols here, we give the symbol chart of the 
International Phonetic Association (IPA) on p. 16. (Not all of these segments will fig- 
ure in this book.) 


1.5 CONSTRICTIONS 


In the vocal tract various types of constriction can be made in different places. We will 
first go through the different places, and then discuss the different kinds of constric- 
tion. The expression ‘to articulate with X’ is used to mean ‘to form a constriction at X’. 


1.5.1 Places of articulation 
Pharyngeal 


The root of the tongue articulates with the back wall of the pharynx. Gulf Arabic has 
a voiceless pharyngeal fricative, as in [laham] ‘meat’. 


Dorsal 


The dorsum articulates with the roof of the mouth. If it is the back that articulates 
with the soft palate, the term velar is used. This place is used for [k] in French quand 
‘when’, for [p] in English hang, for [g] in English good and for [x] in Scottish English 
loch. If it is the front which articulates with the hard palate, the term palatal is used. It 
is found in German nicht, where [¢] occurs, and in French qui, where a voiceless plo- 
sive [k] occurs. Such consonants are really ‘fronted velars’, and should be distin- 
guished from palatals or palatoalveolars that are produced somewhat further forward, 
like the [c] in Dutch [kacjs] ‘cat+pi’, or English [f] in ship. These latter segments 
also involve a raising of the crown, and are therefore treated under ‘Coronal’ below. 
Retraction of the dorsum allows the back of the tongue to articulate with the uvula. A 
voiceless uvular fricative [X] occurs in Western Dutch, as in [xe:l] ‘yellow’. The uvular 
stops [q,¢] occur in Tlingit, for instance. 
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Coronal 


The crown may articulate with the teeth, the alveolar ridge or the forward part of the 
hard palate immediately behind the alveolar ridge. When the crown articulates with 
the upper teeth, dental segments are produced, like [0] in English thing and [ð] in this 
and that; the dental plosives [t,d] occur for instance in Sinhalese [tada] ‘hard’. The 
label alveolar is used if the crown articulates with the alveolar ridge. Examples are [t] 
and [d] in German [t"u:n] ‘do’, [du:] ‘you’, [n] in [naxt] ‘night or [I] as in ['alas] ‘every- 
thing’. If the crown articulates with the rear edge of the alveolar ridge, a postalveolar 
consonant is produced. English [f] in shore, [3] in measure, [tf] in chip and [d3] in jet 
are articulated with the crown of the tongue, while the front of the tongue is raised 
towards the hard palate. Often, as in Dutch, the tip is held behind the lower teeth for 
this type of consonant, in which case the contact is alternatively labelled prepalatal. A 
postalveolar articulation with just the tip of the tongue occurs in English [t1] as in try. 
If this type of contact is made with the lower surface of the tongue blade, keeping the 
tongue tip curled back, the term retroflex is used. Like many languages spoken in 
India, Hindi has retroflex consonants such as [t,dn] as in [trandi] ‘cold’ and [gfanta] 
‘hour’. 


Labial 


If the lips articulate with each other, as in English [p, b, m] in spot, bell, mad, the place 
is bilabial. If the lower lip articulates with the upper teeth, the place is labiodental. It 
is used for [f] in German [fi:] ‘cattle or French [fer] ‘do’. 


1.5.2 Types of constriction 
A first subdivision distinguishes between two kinds of constriction. 


1. A constriction that is tight enough to lead to friction when a (voiced or voice- 
less) airstream is passed through it, as used with obstruents. 


2. A type of constriction that is wide enough for a voiced airstream to pass 
through without friction, which is used for sonorants. 


In the case of obstruents, an acoustic source is actually created at the point of articula- 
tion: either a popping sound is produced (for plosives) or friction is produced at that 
spot (for fricatives and affricated plosives). The auditory quality of sonorants relies 
exclusively on the different shapes the vocal tract is given, i.e. on the resulting modifica- 
tions of the acoustic characteristics of the sound produced by phonation in the larynx. 
Obstruents are subdivided into plosives (also called stops), fricatives and affricates. 


Plosives 


These are formed by creating a complete closure at some point in the speech tract, 
behind which the air from the lungs is compressed until the closure is abruptly released 
so that the air explodes outwards. Since the soft palate is raised, the air cannot escape 
through the nasal cavity. Examples are French voiceless [p,t,k], as in [pip] ‘pipe’, [tip] 
‘type’, [e'kip] ‘crew’. Voiced plosives occur in French [bi'de] ‘bidet’ and [ga] ‘glove’. 
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Plosives have a very brief friction burst when they are released, which is not usually 
heard as friction, but is responsible for the ‘popping’ quality of plosive releases. 


Fricatives 


These are formed by narrowing the speech tract to such a degree that audible fric- 
tion is produced when air passes through. English has the voiceless labiodental frica- 
tive [f] in fee, the voiceless dental fricative [0] in thigh, the voiceless alveolar [s] in sigh 
and the voiceless palatoalveolar [f] in shy. The voiced counterparts [v,0,Z,3] occur in 
vie, that, zoo and measure, respectively. (At the beginning of the syllable, English [v,ð] 
frequently are pronounced without friction.) 


Affricates 


These are dealt with in section 1.7.3. 


Sonorants 


These divide into nasals and approximants. 


Nasals 


For nasals the soft palate is lowered, and the oral cavity is blocked completely at some 
point. A slow, deliberate pronunciation of morning will allow us to observe how each 
of the three nasal consonants in the word has a different place of articulation: a bila- 
bial [m], an alveolar [n] and a dorsal [n]. (Pre-)palatal [p] occurs in Dutch, as in 
['spanja] ‘Spain’, and in French, as in [a'po] ‘lamb’. 


Approximants 


Approximants derive their name from the approximation of the articulators which 
gives rise to a light or near-contact, the airstream being so weak that no friction is 
produced. The (pre-)palatal approximant [j] occurs in English yes, while a bilabial one 
([B]) occurs in Southern Dutch [Bat] ‘what’. (When combined with a symbol for a 
voiced fricative, the subscript [ |] indicates an approximant, i.e. frictionless pronunci- 
ation.) Frequently, languages have rhotics, or [r]-type segments, that are approximant, 
such as the postalveolar approximant [1] in English ray. Also trills occur, during which 
the uvula is allowed to vibrate against the back of the tongue [R], as in European 
Portuguese [karu] ‘car’, or the tongue tip against the alveolar ridge [r], as in Spanish 
[pero] ‘dog’. When, instead of a series of vibratory taps, a single such brief contact is 
made, a flap is produced ([r]). In Spanish and Catalan the alveolar flap contrasts with a 
trill, a minimal pair in Catalan being [ para] ‘father’ - [para] ‘grapevine’. In Arawak, 
the alveolar flap contrasts with a retroflex flap, which involves flicking the curled back 
tongue forward, causing the tip to hit the rear edge of the alveolar ridge, as illustrated in 
[hòroro] ‘swampy’ - [hororol ‘cloud’ (Pet 1979), while Toda contrasts three places of 
articulation for trills (or flaps), postdental, alveolar and retroflex, as in [kar] ‘border of 
cloth’, [kar] ‘juice’ and [kar] ‘pen for calves’ (Spajić et al. 1996). 
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For [I],as in German [alas] ‘everything’, the airstream is partly blocked by the tongue 
tip contact with the alveolar ridge, but allowed to escape freely on one or both sides. 
Sounds which have this type of partial occlusion are called lateral. Because the air 
escapes without friction through the lateral opening, the German lateral is an approxi- 
mant. The voiceless lateral fricative ([4]) has a turbulent escape of air along the lateral 
opening(s); it occurs in Welsh, as in the place-name [tan‘didno] (landudno). Laterals are 
usually alveolar: the crown articulates with the alveolar ridge. A palatal (approximant) 
lateral [A] occurs in Italian ['zbaá:0] ‘mistake’ or Catalan [Aop] ‘wolf’. 


1.6 SEGMENTAL DURATION 


In many languages, duration contrasts exist in the group of vowels, as in Hawaiian, or 
in the group of consonants, as in Italian, or both, as in Finnish and Japanese. Long con- 
sonants are also called geminates. Italian and Japanese have long consonants in inter- 
vocalic position. An illustration of such a duration contrast is Italian [fato] ‘fate’ versus 
[fat:0] ‘fact’. Imonda has this type of contrast in initial position, as illustrated by 
[nzol] ‘uncle’ versus [nől] ‘seed’ (Seiler 1985). Frequently, only a subset of the conso- 
nants occurs both long and short. Thus, only [p,t,k,s] occur as long consonants in the 
native vocabulary of Japanese. Vocalic duration contrasts, such as exist in Hawaiian, 
are more common. As in the case of consonantal duration contrasts, sometimes only a 
subset of the vowels occurs both long and short. Thus, Chipewyan has five short vow- 
els ([i,e,a,0,u]), but only three of them have long variants ([i:,a:,u:]). Moreover, the 
quality of the long vowels may differ somewhat from that of the short vowels. Finnish, 
for instance, has eight monophthongal vowels appearing both short and long, but long 
[e:] has a closer quality than short [e]. 

In German, the durational contrast coincides with a difference in the position of the 
root of the tongue, which is advanced somewhat during the pronunciation of the long 
vowels. Such vowels are known as tense, while their unmodified counterparts are lax. 
As a result of the widening of the pharynx, the tongue body will tend to be higher for 
tense vowels than for lax vowels, at least for vowels that are not fully open. In (4), 
which gives the vowel system of Standard German, the symbol on the left in each box 
is a lax, short vowel, while the symbol on the right represents a long, tense vowel. 
Examples of words with these vowels can be found on p. 70. 


Front Centralized Centralized 
unrounded front back 


rounded unrounded 
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In more conservative German, there is also a long lax vowel [æ:] (bracketed in (4)), 
which has merged with [e:] in the speech of many speakers. In other West Germanic 
languages, too, there may be long lax vowels, like southern British English [æ:] in sad, 
or short tense vowels, like [i] in Dutch [fits] ‘bicycle’. 


1.7 COMPLEX CONSONANTS 


Complex consonants are single segments which in some way have two distinguishable 
articulations. We distinguish consonants with a secondary articulation, consonants 
with a double articulation, and manner-contour consonants. 


1.7.1 Secondary articulations 


The articulation of a consonant does not require the services of the entire tongue as well 
as the lips. For [t,d], for instance, only the crown is used, and for [fp,m] only the lips. In 
fact, the lips and tongue can be used to form a vocalic articulation simultaneously with the 
production of the consonant. The result is what is known as a consonant with secondary 
articulation. The following types of secondary articulation can be distinguished: 


1. Labialization. During the articulation of the consonant, the lips are rounded. 
A labialized velar plosive, for instance, is symbolized [kw]. 


2. Palatalization. The front of the tongue is raised (as for [i] or [j]) during the 


pronunciation of a consonant. A palatalized bilabial plosive, for instance, is 
symbolized [p]. 


3. Velarization. The back of the tongue is raised (as for [u] or [w]), during the pro- 
nunciation of a consonant. A velarized alveolar lateral approximant (‘dark P), 
symbolized [t], is used postvocalically in most varieties of English, as in all. 


4. Pharyngealization. The root of the tongue is retracted towards the back wall of 
the pharynx. Arabic ‘emphatic’ consonants are pharyngealized. This is indicated 
by placing a dot below the phonetic symbol concerned (e.g. [s]). 


1.7.2 Double articulations 


Some consonants have two consonant-type constrictions at the same time, such that 
it is not possible to say which is ‘primary’ (i.e. consonant-like) and which ‘secondary’ 
(vowel-like). Well-known examples are [kp] and [gb], labial-velar plosives, which 
occur in many Niger-Congo languages. The English consonant [w] is a labial-velar 
approximant: it is pronounced with a raising of the back, as well as with rounding of 
the lips. Such consonants are said to involve a double articulation. 


1.7.3 Manner-contour consonants 


Some consonants change their constriction-type half-way through. An example is 
provided by prenasalized stops, which occur for instance in Bantu languages. 
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Such consonants begin like nasals and end like plosives at the same place of articula- 
tion ({mb], [nd], [ng]). Similarly, an affricate is a combination of a plosive anda frica- 
tive at the same place of articulation. They are thus like plosives, but the release is so 
slow that friction is heard. German has labial and alveolar affricates in [tsait] ‘time’ 
and [pfaifa] ‘pipe’, English palatoalveolar [tf] in chap and [d3] in Jack. 


1.8 NONPULMONIC CONSONANTS 


We briefly describe the three classes of nonpulmonic segment here. 


1. Clicks. Clicks are produced with the help of a velar closure (as for [k]) plus a 
closure somewhere further forward. This forward closure may be located at the 
lips (rare) or at, or immediately behind, the alveolar ridge. The trapped air in the 
pocket in front of the velar closure is rarefied by lowering the body of the tongue, 
or — in the case of a bilabial click — of the jaw. When the forward closure is released 
to allow outside air to rush into the pocket of rarefied air, a clacking noise-burst 
results. Clicks may be contrastively accompanied by a glottal stop at the beginning 
of the postclick vowel, by nasalization on the following vowel, by aspiration pre- 
ceding the following vowel, or by a slow (affricated) release of the velar closure. 
Five articulation places for the forward contact occur. Their symbols are included 
under ‘Other symbols’ in the IPA chart. The alveolar click is used paralinguisti- 
cally in English as a sign of disapproval (rendered tut, tut in British English and 
tsk, tsk in American English), while repeated lateral alveolar clicks are sometimes 
used to imitate the noise a horse's hooves make on the pavement. Clicks only 
occur in languages spoken in southern Africa, !XU, Nama and Xhosa being well- 
known examples of such languages. 


2. Implosives. Implosives have a closure as for plosives, as well as a closed glottis. 
By lowering the larynx, the air in the mouth and pharyx is rarefied, so that a 
noise-burst will result when the oral closure is released. Because on its way 
down, the speaker relaxes his glottal closure, phonation will occur as the glottis 
meets the air in the trachea. Such speech segments occur in Vietnamese, as well 
as in some Niger-Congo languages. Their symbols are [6,d,g]. 


3. Ejectives. The initial articulatory configuration of ejectives is like that used for 
implosives, but instead of being pulled down, the larynx is pushed up. The air 
inside the pharynx and mouth is compressed, so that upon the release of the 
oral closure, an egressive noise-burst occurs. Ejectives are reasonably common, 
and are symbolized [p’,t’,k’]. 


1.9 STRESS 


Words consist of rhythmic units called feet, the most common foot type being disyl- 
labic. One of the syllables of the foot is more prominent or stronger than the other 
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syllable(s) in it, which for this reason is called the stressed syllable. In city, this is the 
first syllable. There are two types of foot, and languages either have the one or the 
other. In left-dominant feet, the first syllable is the stressed syllable and the others are 
unstressed. In right-dominant feet, the rightmost syllable is the stressed syllable. 
English has left-dominant feet. A word like celebration contains the two feet [sela] 
and [bre1fn]. One of the feet in a word has the primary stress or word stress. In cele- 
bration this is the last foot. The other feet have secondary stress. The IPA notation 
for primary stress is ['], and for secondary stress is [,], to be placed before the sylla- 
bles concerned. In most publications on this topic, ['] is used over the vowel for pri- 
mary stress, and ['] for secondary stress. The English word ,cele'bration (or 
célebrdtion) contrasts with 'alli gator (or dlligator) in the location of the primary 
stress: while both alligator and celebration consist of two feet, the first foot has pri- 
mary stress in alligator, but the second in celebration. In addition to disyllabic feet, 
there may also be monosyllabic feet, as in cat, and ternary feet, as in origin ['orad3on]. 
Examples of English disyllabic words that consist of two monosyllabic feet are 
['sin,taeks] syntax and [,ken'ti:n] canteén. Thus, within the foot, a distinction 
between stressed and unstressed syllables is made; among the different feet in a word, 
a distinction between primary stress and secondary stress is made. Notice that ['] and 
[`] are also used to indicate tone (cf. section 1.2.4). 


1.10 CONCLUSION 


This chapter has outlined the workings of the speech production mechanism in a way 
that will enable you to follow the discussion in the rest of this book. For many users of 
the book, it will have served as a brief refresher course while for others, who may be 
new to the topic, it will have served as an introduction. In any event, it will probably be 
useful to refer back to this chapter whenever issues of articulation are discussed, in 
particular in chapters 2,5 and 11. 


THE INTERNATIONAL PHONETIC ALPHABET (revised to 1993, corrected 1996) 


CONSONANTS (PULMONIC) 
站 Bilabial | Labiodental 


Plosive | p b a, | 
Nasal m m| 


Trill 


| = Ff 
Tapor Flap | 
L 


Fricative 


Lateral 
fricative 
pa 


| Approximant 


Lateral 
| approximant 


Where symbols appear in pairs, the one to the right represents a voiced consonant. Shaded areas denote articulations judged impossible. 


CONSONANTS (NONPULMONIC) 


Clicks | Voiced implosives Ejectives 


f 

O Bilabial | D  pitabia i Examples: 

ie | Dental 
一 


Pp Biebia 


d 

$ t R DentaValveolar 
Palatoalveolar g Velar | K Velar 

G 


iE easa keat 


(Post)alveolar 


Uvular | S Alveolar fricative æ 


n Open aæ—— alp 


Where symbols appear in pairs, the one 
to the right represents a rounded vowel. 


OTHER SYMBOLS 
AA Voiceless labial-velar fricative È Z@ Alveolo-palatal fricatives 


W Voiced labial-velar approximant ji Alveolar lateral flap 


Voiced labial-palatal approximant f Simultaneous J and X 


Voiceless epiglottal fricative Affricates and double articulations 
can be represented by two symbols 


joined by a tie bar if necessary. 
Voiced epiglottal fricative 


中 Y S 


kp ts SUPRASEGMENTALS 
Epiglottal plosive p ~ a 
Primary stress 
ı Secondary stress 
founo" tifon 
P : Long e€: 
DIACRITICS Diacritics may be placed above a symbol with a descender, e.g. n 
P = a al = | ” Halflong €' 
| Voiceless n d H Breathy voiced b a | Dental t d | ~ < 
i k Lae a Extra-short © 
Voiced S t ~ Creaky voiced b a u Apical t d | Minor (foot) group 
spe ae us ? so 
_ oe E d j- Linguotabial t d p Lamina t d | Il Major (intonation) group 
Wm fl č i 
More rounded Q Labialized Nasalized e 。 Syllable break Ji.aekt 
Less rounded 2 J Palatalized t d! ji Nasal release dn ~ Linking (absence of a break) 
f pr: z ER y | 
Advanced u | Y Velarized tY dY |T Lateral release d! TONES EV ORD MORENTE OR 
p rs = 7 ae 5 oo 
_ Retracted è | Pharyngealized m7 d | EE Bo Ea A Rising 
四 Cemralized é | 7 Velarized or pharyngealized t © Taa © VY tee 
r3 x 一 -一 = x High 
Mid- centralized © Raised € ( I = voiced alveolar fricative) e 4 Mit e 4 tee 
一 | a è Jie e 4S, 
|, Syllabic n z Lowered eo B = voiced bilabial approximant) è 了 Exe é 4] Rising: 
e e | low falling 
Nonsyllabic ”名 |a Advanced Tongue Root ~ = | t Downstep 7 Global rise 
_Rhoticity e a whee Retracted Tongue Root € 2 | T Upstep N Global fall 
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2.1 INTRODUCTION 


In this chapter we will consider just what it means for languages to have structured 
sound systems. Also, we will see that although languages have different phonologies, the 
phonologies of different languages are in many respects very similar, and that much of 
the phonological structure of a language is in fact universal. In large part, these cross- 
language similarities are due to the ergonomics of the speech process: speakers, and 
languages, prefer distinctions that are easy to perceive and easy to produce. We will 
discuss these points mainly on the basis of the variation languages show in their seg- 
ments; but variation in syllable structure is also dealt with. Finally, we will see how this 
state of affairs defines the aim of phonological theory. 


2.2 _MORPHOSYNTACTIC STRUCTURE 


Phonology is concerned with a particular aspect of linguistic structure. In order to see 
what part of linguistic structure is phonological, we will first briefly consider that part 
of the linguistic structure which is not phonological, the morphosyntactic structure. 
The morphosyntactic structure of a language can be seen in the arrangement of the 
meaningful units of any linguistic expression. A distinction is made between morph- 
ology, which deals with the structure up to the level of the word, and syntax, which 
deals with the structure above the word. 


2.2.1 Morphological structure 


The smallest morphosyntactic unit is known as the morpheme. The word scratch is a 
single morpheme, but the word pens contains two morphemes. A morpheme that can 
be a word by itself is a free morpheme. The morpheme pen could form a word by 
itself, and is for that reason an example of a free morpheme, while the morpheme s is 
a bound morpheme, because it must combine with some other morpheme in order 
to be part of a word. A word consisting of a single (necessarily free) morpheme is a 
simplex word, other words being complex. An important group of bound morphemes 
are affixes, which can be divided into prefixes and suffixes, depending on whether 
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they are placed before or after what is known as the base. Thus, the s in our example 
is a suffix because it is placed after the base pen, and so is able in the complex word 
scratchable. 

Most words have a category membership, which allows them to be used in certain 
positions in sentence structure. If we assume that a sentence consists of a Noun and a 
Verb, in that order, then the sentence Pens scratch is well-formed, while *Scratch pens 
is an ill-formed sentence (as indicated by the *). Affixes are subcategorized for a cat- 
egory membership, meaning that they only attach to bases that are members of a par- 
ticular category. Thus, the plural suffix s attaches to nouns, while the suffix able attaches 
to transitive verbs, i.e. verbs that take a Direct Object, such as scratch, read and drink 
(but not sleep: *sleepable). Affixes are also distinguished on the basis of whether they 
create new words, in which case they are known as derivational affixes, or merely create 
new forms of the same word, like the plural suffix s or the past-tense suffix in scratched, 
in which case they are inflectional. Derivational suffixes come with their own category 
membership, which is frequently different from the category of their base. Thus, able 
has the category Adjective, and every complex word ending in able is therefore an 
Adjective. In (1), the structural information for these two affixes is given in a notation 
known as labelled bracketing. 


(1) a ,ll Jys) PLURAL 
b. [[ Ivirans able] 44) ‘is able to Verb’ 


Observe that a base can be simplex or complex. In scratchers, the plural suffix was 
attached to scratcher, itself composed of scratch and er. A simplex base which is a free 
morpheme is also known as a stem, while a simplex base that is not, like the English 
verbal base mit in transmit, is also known as a root. 


2.2.2 Syntactic structure 


Words do not directly enter into sentence structure. If the structure of our simple sen- 
tence S were simply [N V], it would only be possible to form sentences of the type 
Kittens scratch, Pens leak and Time flies. In fact, we can also form A good pen mustn't 
scratch or The pen I bought in Italy leaks. Here, the structural position of Pens in our 
original example sentence Pens scratch is occupied by the word groups A good pen and 
The pen I bought in Italy. This is the reason why, in between the structural level of the 
word, illustrated in (1), and the structural level of the sentence (or clause), we need 
to recognize the structural level of the (syntactic) phrase. That is, instead of [N V], 
our simple sentence has the structure [Noun Phrase - Verb Phrase], or [NP VP]. In 
addition to the NP and the VP, another frequent type of phrase is the Prepositional 
Phrase, or PP, which consists of a preposition plus an NP, such as with the leaky pen, on 
the surface or in Italy. 

The introduction of a level of phrasal structure allows us to point to two properties 
of morphosyntactic structure. One is that this structure is hierarchical. This means 
that we can distinguish higher and lower levels of structure: a word level, a phrase 
level and a sentence level. The second property is recursiveness. This means that a 
unit of a given structural level can be incorporated into a unit of the same structural 
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level, or that a unit of a higher level of structure can be incorporated into a unit of a 
lower level. For example, the sentence The pen I bought in Italy leaks consists of two 
phrases, the NP The pen I bought in Italy and the VP leaks. The first of these is made up 
of an NP The pen and an S (which) I bought in Italy, which sentence in turn consists of 
the NP J and a VP bought which (i.e. the pen) in Italy. The structure in (2) expresses 
this. In (3), we give the mini-grammar that will produce this sentence. Recursiveness 
is shown in (3b), first, by the fact that we can feed a higher-level S into a lower-level NP 
and, second, by the fact that we can feed an NP into an NP. As you will have guessed, 
we can — at least in theory ~ apply the same rules again and again. It may be fun to do 
this, as illustrated by the children’s verse This is the farmer that kissed the girl that chased 
the cat that killed the mouse, etc. 


(2) sl wel wel The pen] yp sl well] vp ye[ bought (sc. the pen) in Italy] yp ]s ]ypyp[leaks] yp ]s 
(3) a S: NPVP 
b. NP: NPS 


There is much that is universal about the syntax of the languages of the world, but, of 
course, there is also a good deal of variation. Some languages have a VP in which the 
Verb precedes the rest (i.e. [V NP] or [V PP]), while other languages maintain the 
opposite order (i.e. [NP V] and [PP V]). Moreover, some languages have a syntactic rule 
that moves the Verb to initial position in the sentence. This type of variation is often 
referred to as that between S(ubject) V(erb)O(bject), SOV and VSO. 


Q1 Identify the nouns and verbs in the following words: 


1. unscratchable 
2. road tax increase 


Q2 identify all the NPs in the following sentence: 
He wrote the letter with a pen he bought in Italy 


2.3 A WORLD WITHOUT PHONOLOGICAL STRUCTURE 


Spoken languages use human vocal sound to give shape to their morphemes. Phonology 
is the branch of linguistics that aims to describe the way in which this medium of human 
vocal sound is structured, in languages generally as well as in individual languages. To 
see what is meant by the sound structure of languages, it may be instructive to pretend 
for a while that languages do not have it. Imagine that every morpheme of a language 
were assigned some vocalization. Conceivably, these vocalizations could be quite lengthy 
in view of the large number that would be needed to distinguish all the morphemes of 
the language. But, importantly, there would be no implication that they should have 
structure in the sense that they are composed of subparts, any more than abstract 
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paintings are. To give a hypothetical example, vocalization (4) might be the morpheme 
meaning ‘oak tree’ in some language. 


(4) ‘oak tree’: high-pitched wheeze, trailing off into a voiced cough with central, nasal 
vowel quality 


The wide variation in morphosyntactic structure that is found in the languages of 
the world could exist without there being any structure to the human vocal sounds 
that languages use as a medium. Clearly, in the hypothetical situation described above, 
languages could still have SVO or VSO as their basic word order. They could either have 
an extensive Case system (i.e. be strongly inflectional) or have many prepositions; 
they may or may not have articles, they may or may not mark plural in both Subjects 
and Verbs (‘Concord’) and so on. Where our hypothetical situation is different from 
the situation in the real world is that nothing would have to be said about the sound 
structure, because there would not be any. All that anyone could do is make a list of 
descriptions like the one in (4). 


Q3 Why, in the hypothetical situation above, would it be impossible for speech errors like 


[klts kea] for kiss Claire to occur? 


Now let us turn to real life. The first observation to be made about the pronunciation 
of morphemes is that there are recurring elements. For instance, the sound patterns 
used for morphemes can be analysed as strings of segments. This is the basis on which 
we can say that English cat has the same segments as tack or act, although the segments 
occur in different orders. Minimally, then, the task of phonology is to state what the 
recurring elements are. 

The second observation to be made is that the recurring elements do not occur in 
all possible orders. For example, while cat, act, tack are possible combinations in English 
of the segments [k], [æ] and [t], the same does not go for [atk]. Notice that it is not 
the case that the sequence [tk] never occurs in English. This sequence is part of the 
word Atkins, for instance, and occurs in the sentence The cat killed the mouse. What 
this shows is that there is some constituent higher than the segment which imposes con- 
straints on what sequences of segments it may contain. In the example, that con- 
stituent is the syllable. That is, *[atk] cannot occur as a sequence of segments inside 
the syllable. In general, elements at one level of structure combine in restricted ways to 
form elements of a higher level of structure. We will see in Chapter 5 that the segment 
is to be looked upon as an element that itself combines elements of a lower structural 
level, called distinctive features. So a second task for phonological theory is to state 
what the permitted patterns of arrangement of the phonological elements are. 

A third important observation is that segments may be pronounced differently 
depending on their environment. This is because languages usually have processes 
that affect (‘change’) segments in particular contexts. In English, for example, [p,t,k] 
are aspirated when they are the first consonant of a syllable. Hence, tack is pronounced 
[tek], but stack is [stæk]. Similarly, the Dutch [z] in [ze:] ‘sea is pronounced [s] 


when it is preceded by an obstruent, as in [9p se:] at sea’. Or again, in French the 
final consonant of an adjective like [patit] ‘small’ is pronounced in le petit autobus‘the 
little bus’, but not in le petit camion ‘the little lorry’, as a result of the difference in 
the first segment in the following noun. As these examples make clear, one and the 
same morpheme may have different pronunciations in different contexts as a result of 
the existence of these processes. 

Phonological theory, then, will have to account for these three aspects of phonological 
structure. 


Q4 Do you think that sign languages for the deaf have phonologies? _ : 


2.3.1 One phonology for all languages? 


Now, it might have been the case that all human languages had the same sound struc- 
ture. In order to see what this would mean, lets once more enter into an imaginary 
world. We will make up a ‘universal phonology’, and then give partial descriptions of 
two — very different — languages. The universal sound structure is given in (5), and (6) 
gives partial descriptions of the two languages. 


(5) a. Segment inventory: 


b. Arrangement: 
A word consists of one, two or three syllables. 
A syllable consists of CV. 
c. Process: 
FRICATION Initial obstruents in non-initial words are fricatives. 


Because of FRICATION, [p,t,k] will be [f,8,x] and [b,d,g] will be [v,ð,x] in the context 
specified. Because the fricatives are variants of the plosives, they are not listed sepa- 
rately in (5a). 


(6) Language I Language II 
Syntax SVO VSO 
Subject-Verb Concord yes no 
‘cow’ pi kapu 
‘graze’ namu ni 
‘field’ diku pu 
Plural [[ ]NwvlulPLuR [[ ]NtipPuR 
‘in’ (Preposition) ma 一 
Locative Case 一 [[ Jx laltroc 


i 
a 


1035442 
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Using the data in the first column of (6), we find that in Language I the sentence 
meaning ‘Cows are grazing in the field’ would be [pilu namulu ma diku]. Here, 
Subject= Verb Concord is expressed by the double occurrence of [-lu], while the posi- 
tion of the PP ‘in the field’ is that of the O in SVO. 


Why does the word for ‘field’ in the sentence of Language | begin with [6], and not with [d]? 


What is the translation of this sentence into Language II? 
Why would the process we assumed in (5c) be a convenient feature of human language, 
as seen from the point of view of the listener? 


An important reason why a single phonology for all languages is an impossibility 
is that people use language to signal their social status. A speech community's social 
structure is typically reflected in its language in that different groups use slightly dif- 
ferent linguistic forms. Moreover, social structure is not static, and certain groups of 
speakers may want to distinguish themselves by starting new pronunciation trends, 
which may then be followed by ever-widening social circles around them, leading to a 
considerable degree of variability as well as change. Because geographical dispersion 
will inevitably slow down or stop the progress of such change, the result is typically 
that we observe complexes of linguistic differences associated with different regional 
dialects, or different but related languages. 


Q6 Japanese has the following processes: 


1. Close vowels [i,w] are devoiced to [i,w] between voiceless segments or between a 
voiceless segment and the word end. 

2. The alveolar plosive [t] is affricated to [ts] before [w]. 

3. The alveolar obstruents [t, s] are prepalatal [t/, f] before [i]. 


4. The glottal fricative [h] is pronounced as palatal [ç] before [i]. 


Give the pronunciations of the following words, as described by these four processes: 


[hibat] ` ‘brazier’ 
[twkemono] ‘pickled vegetables’ 
[isi] ‘doctor’ 


2.3.2 Two kinds of structure 


The fact that the vocalizations that are used to represent the meaningful units of lan- 
guage themselves have structure has an important consequence. This is that terms 
like ‘constituent or ‘unit’ are ambiguous. The speaker or writer might be referring to a 
morphosyntactic unit, such as ‘the morpheme hill or ‘the sentence 1 like it’, or to a 
phonological unit like ‘the segment [£] or ‘the syllable [ket]’. 

Ke rE? E S | 
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Q7 Divide the following English words up into (a) morphemes and (b) syllables: 


elephants 
palm oil 
unsettling 


In the case of English [æ], [k] and [t] all permitted arrangements are in fact words: cat, tack, 
act. But if we replace [æ] with [1], combining the three segments yields only two words: kit and 
tick. Would it be correct to say that [rkt] is an ill-formed combination in English? 

Can you explain why [mrat] would never be introduced as a brand-name in English, while 
[tram] might well be? 


Thus, a linguistic expression always has two structures, a morphosyntactic one, 
which reflects the meaningful elements in the expression, and a phonological one, 
which is the structure most immediately relevant to the pronunciation of the expres- 
sion. The distinction is a very real one, because the morphosyntactic constituents do 
not map one to one onto the phonological constituents; to use a technical term, they 
are not isomorphic. That is, morphemes do not exclusively correspond to segments, 
or exclusively to syllables, etc. While the Dutch polite second person pronoun consists 
of the one segment [y], the morpheme [tk] T consists of two. And while these two 
pronouns each consist of one syllable, the informal second person plural pronoun 
consists of two: [jy.li]. (The period‘. is often used to indicate syllable boundaries.) In 
English, the single syllable [su:z] can represent two morphemes, Sue and is as pro- 
nounced in the sentence Sue is ticklish. In (7), the morphosyntactic structure of that 
sentence is given. The way the sentence is analysed in morphosyntactic constituents 
should be compared with the (partial) phonological constituent analysis of that same 
sentence, given in (8). Notice that just like the morphosyntactic structure, the phono- 
logical structure is hierarchical. There is a layer of segments, which build a layer of 
syllables (symbolized o) which build a layer of feet (symbolized F), which build a 
layer of phonological words (symbolized œ), and so on. In addition to the different 
constituent structures for Sue is, for example, notice that ticklish consists of two sylla- 
bles as well as of two morphemes, but that the syllables and the morphemes do not 
divide the word up in the same way. The dual structure outlined in this section exists 
in all languages. It is dealt with in more detail in Chapter 15. 


(7) S 
NP VP 


ee eer 
Word Word Word 


a es 
Morpheme Morpheme Morpheme Morpheme 


| | 
sui Z tikl if 
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2.4 THERE'S A DIFFERENCE 


As you will have realized, not only do languages differ in their morphosyntax, but their 
phonologies are very different, too. To begin with, they differ greatly in the number of 
segments they have. In the UCLA Phonological Segment Inventory Database (UPSID), 
a corpus of segment inventories of 317 languages (approximately 6% of all the languages 
of the world), the smallest number appears to be 11 (e.g. Rotokas, spoken in Papua 
New Guinea) and the largest a staggering 141 (!Xt, spoken in Namibia and Angola) 
(Maddieson 1984: 9). And when two languages have the same number of segments, 
they are unlikely to have identical sets. Another language that, like Rotokas, has 11 seg- 
ments is Mura, which is spoken in Colombia. It shares [p,t,k,g,i,0,a] with Rotokas, but 
while Rotokas has [e,u] and the consonants [B] and a flap, Mura has [?,b,s,h]. 

Second, different languages will have different constraints on the way segments are 
combined to form syllables. The initial consonant(s) of the syllable are known as the 
onset, the vowel is in the peak and the closing consonant(s) form the coda. Coda and 
peak form a constituent called the rhyme (also spelled rime). In (9), we show these 
constituents in a tree diagram. 


(9) T 


ONSET PEAK CODA 


C- V -C 


According to Blevins (1995), the lowest degree of complexity in syllable structure is 
represented by languages that have a single (short) vowel in the peak and optionally 
allow maximally one consonant in the onset. The syllable structure of such languages 
is (C)V. Further complexity can be achieved in a number of ways: 


1. The Onset may be obligatory: CV. In such languages, of which Maba, Klamath 
and Arabela are examples, syllables cannot begin with a vowel. 


2. There may be a coda. Languages that don’t allow a coda include Hawaiian, 
Arabela and Fijian. If the language allows a coda, a further option is that the 
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coda may be complex, i.e. may be a cluster containing more than one C. Spanish, 
Japanese and Italian allow only one C, but Klamath and French allow two. 


3. The onset may be complex. Some languages allow only one C in the onset, like 
Finnish, Hawaiian and Klamath; others allow two or more, such as Arabela, 
Dutch and Spanish. 


4. The peak may be complex, i.e. be VV. This is the case if, by the side of short 
vowels, the language has long vowels (in which case the peak is filled by a double 
occurrence of the same vowel, V;V;) and/or has diphthongs (in which case the 
peak is filled by two different vowels, V1V,). Spanish has the diphthong [ue], but 
no long vowels, while Wolof has long and short vowels but no diphthongs. Dutch 
and Finnish have both long vowels and diphthongs. Dinka and Estonian are rare 
examples of languages with overlong vowels, i.e. they have the peaks V, VV and 
VVV. Phonological vowel-length contrasts are known as quantity distinctions. 


Some languages allow the syllable peak, i.e. the structural position indicated by ‘V’, to 
be filled with a consonant. Thus, in Czech, the liquids [r,l] appear in the peak, as in 
the geographical names ['br.no], ['vl.tava], Brno, Vltava and American English 
has words like girl [gr:l] and mountain ['mauntn]. Where languages agree in the 
number of consonants they allow in the coda or onset, they will still differ in how these 
consonant positions can be filled. In German, [kn-, fn-] are possible onsets, as in 
['kna:.ba] ‘boy’, ['{ne.ka] ‘snail’, and so are [ps-, ks-], as in [psy.co.'lo:.g1J] ‘psycho- 
logical’, [ksan.tn] Xanten’, all of which are lacking in English. ([ts-] as in [tsa:r] ‘Czar’ 
is usually interpreted as a single segment, an affricate.) Conversely, English has [lj-] in 
lure, which is absent in German. An unexceptional implicational relationship is that 
the presence of a complex constituent implies the permissability of less complex con- 
stituents. Thus, a language that allows two consonants in the onset will also allow one 
consonant in the onset. 

Third, different languages will have different phonological processes. A process 
that occurs in British English but not in German is PREGLOTTALIZATION, by which [p,t,k] 
are [?p, ?t, ?k] when they occur at the end of a syllable before another consonant, as in 
ripped, mats, thickness. And an example of a process in German that does not exist in 
English is FINAL DEVOICING, by which all obstruents (i.e. plosives and fricatives) are 


Q9 The following words, taken from Huisman et al. (1981), illustrate the syllable structure 
of Angaatiha (the period separates the syllables): 


ka.mo.ai ‘him’ 

a.ti.?a.ra ‘thunder’ 

ai.n.ta.?0 ‘bird type’ 
ma.nji.njai ‘children (Objective) 
ta.m.pwai.?0 ‘lizard type’ 


Give the Angaatiha syllable structure as a CV-formula, placing optional elements in brackets. 
Does the language have syllabic consonants? 
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Can you think of (a) a segment which exists in English but not in your own language, and 


(b) a segment that exists in your own language, but not in English? If your native language 
is English, answer the question for any foreign language you are familiar with. 
What would the final consonant of the German word for Kiev be? Why? 


voiceless when they occur at the end of a syllable. This rule is responsible for the fact 
that in German no voiced fricative or plosive appears at the end of a word, even though 
voiced obstruents occur in the inflected forms of such words. For instance, the nomi- 
native form for ‘dog’ is [hunt], while the genitive form is [hundas]. Many languages 
have a rule of FINALDEVOICING, including Polish, Dutch and Catalan. 


2.5 UNIVERSALS AND IMPLICATIONAL RELATIONS 


While it is clear that different languages may have very different phonologies, at the 
same time it is clear they have many things in common. For a start, all languages have 
syllables, and all segment inventories can be split into consonants and vowels. All 
consonant inventories include voiceless plosives, i.e. all languages have at least two of 
the three consonants [p,t,k]. Then, there are near-universals. For instance, only two 
languages in UPSID, Rotokas and Mura, have no sonorant consonants. Or again, all lan- 
guages in UPSID except Hawaiian (i.e. 99.8%) have some kind of [t], and 91.5% have [i]. 
It is also striking that the coronal place of articulation is much commoner generally, and 
also shows more subdivisions, than either the labial or the dorsal places of articulation. 

When the group of more common segments is compared with the group of unusual 
segments, there are two observations to be made: 


1. Unusual segments tend to occur in larger segment inventories. For instance, 
an unusual segment like [kw], a [k] with rounding and aspiration, typically 
occurs in languages with large consonant inventories, like that of Igbo, which 
has 20 pulmonic egressive plosives in addition to three implosives, or Haida, 
which has 46 consonants in all. 


2. Unusual segments tend to be phonologically more complex than common seg- 
ments. For example, a common segment like [k] (99.4%) just involves a complete 
closure between the back of the tongue and the soft palate. The articulation of [k] 
allows organs of speech other than the back of the tongue to take the line of least 
resistance, requiring no accompanying actions of the vocal cords (like aspiration, 
or voicing during the closure, or a glottal closure), of the lips (lip rounding) or of 
the front of the tongue (palatalization). On the other hand, an unusual segment 
like [kwh] requires the same oral closure as [k], but in addition has aspiration as 
well as lip rounding. The relationship is not absolute, however. Although [6], for 
instance, is phonologically simple, it is nevertheless a rare segment. 
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What do these facts suggest about the phonological structure of language? The first fact 
suggests that languages ‘build up’ their phonologies in an ordered fashion. It suggests that 
a language will only have segment X if it already has segment Y. There are in fact many 
such implicational relationships, as Roman Jakobson had already taught the world in 
1928 (Jakobson 1990, chs. 9, 10, 18). Thus, no language has a voiceless nasal without also 
having its voiced counterpart, and no language has [z] without also having [s]. With very 
few exceptions, languages do not have front rounded vowels ([y,@]) if they do not also 
have front unrounded ([i,e]) and back rounded ([u,o]) vowels. As will be clear, most of 
these implications are tendential, and true only in a statistical sense. For instance, the 
presence of [f] generally implies the presence of [p], but Chuave is an example of a 
language for which this is not true. Apparently, there is no such thing as an absolute order 
in which languages avail themselves of the universal phonological resources. 

The second fact suggests that one way in which languages construct their segment 
inventories is by adding elements to already existing segments. If we continue to use 
the metaphor of ‘building up’ the segment inventory, it is as if you begin by making 
some choices from a collection of fairly run-of-the-mill segments, and then, as you 
require more of them, select further elements which you can use to create more seg- 
ments. For example, many languages have [p,t,k], while a smaller number have 
[p,t,k] and [b,d,g]. These latter languages can be seen to have increased their inven- 
tory by adding the element ‘vibrating vocal cords’ to the plosive segments they already 
had. Then, a smaller number still has these, plus [ph, t, kt], for instance. Here, the lan- 
guage can be seen to have added ‘aspiration to the [p,t,k] it already had. (But, again, 
instead of ‘aspiration’, languages may employ other elements that serve to expand 
the segment inventory [p,t,k; b,d,g].) 


Q11 The mean number of consonants in UPSID is 22.8, with a range of 6-95. The mean 
number of vowels is 8.7, with a range of 3—46. In what respects are the following 
languages atypical? 


'XG has 95 consonants and 46 vowels. 
Pawaia has 10 consonants and 12 vowels. 
Haida has 46 consonants and 3 vowels. 
Norwegian has 22 consonants and 19 vowels. 


As Jakobson (1968) emphasized in a famous monograph, children learn the sounds 
of their language in a particular order, regardless of the language, which reflects the 
scale of complexity implied in this paragraph: the unusual tends to be rare and tends 
to be acquired late. Thus, the metaphor of ‘building up’ the phonology in a-sense 
comes true every time a child learns a language. 


2.5.1 Plain or special? 


A third observation is made by Maddieson (1984): the number of vowels and the 
number of consonants are positively correlated. That is, when one language has more 
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consonants than another, it is likely that it also has more vowels. This suggests that 
languages that are more complex in one area of the phonology (in this case, the consonant 
inventory) also tend to be complex in other areas (in this case, the vowel inventory). It 
is apparently not the case that complexity in one area is somehow compensated for by 
simplicity in other areas: phonologies differ in complexity. For instance, languages with 
larger inventories tend to have more complex syllable structures. That is, languages 
that allow only simple syllable structures like CV tend to have smaller inventories than 
languages that, like English and German, allow up to three consonants at the beginning 
of the syllable (cf. Eng [stro:] straw, German [ftro:] ‘straw’), as well as more than one 
at the end (cf. English [lamp] lamp). In tone languages, the number of different pitch 
patterns typically lies between two and six, but the number of such patterns does not 
appear to be greater in languages with smaller inventories or in languages with sim- 
pler syllable structures. UPSID does not list permitted syllable structures, or prosodic 
distinctions, so the facts here are not easy to give. Maddieson (1984: 22), however, 
does present the total number of possible syllables in a small corpus of languages. The 
total number is determined by (a) the number of different vowels and consonants, (b) 
the permitted syllable structures and (c) the number of tonal (prosodic) distinctions. 
If complexity in one area of the phonology goes hand in hand with complexity in 
another, then the greater the number of possible syllables in a language, the larger 
should be the number of segments, and the larger should be the number of permitted 
syllable structures, as well as the number of prosodic distinctions. In the representa- 
tive list of languages Maddieson gives, this is indeed the case. To give some idea of the 
multiplicative effect of these three factors: the language with the largest number of 
different syllables in Maddieson’s minicorpus (Thai) has 146 times as many possible 
syllables as the language with the smallest number (Hawaiian). 


Q12 


In a corpus of 10 languages, the number of segmental contrasts was found to be related to the 


mean word length expressed as the number of segments in the word, in the sense that the 
smaller the segment inventory was the greater was the mean word length (Nettle 1995). 
Speculate on the cause of this negative correlation. 


2.5.2 Avoiding complexity 


There is another, somewhat more tenuous, relation worth drawing attention to. 
Unusual segments are not just unusual because they occur in relatively few languages, 
they also tend to be less frequent in the languages that have them. That is, as languages 
increase their phonological resources, they tend to do less with them. For instance, 
Dutch has front unrounded, back rounded and front rounded vowels. First, the set of 
front unrounded vowels contains one segment more than either the front rounded or 
the back rounded sets. Then, in almost every case, the frequency of occurrence of the 
front rounded vowels is lower than those of the corresponding front unrounded or back 
rounded vowels. The figures in Table 2.1 are based on the frequencies of the vowels in 
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Table 2.1 
Frequency of occurrence (%) of Dutch front unrounded, front rounded and back rounded vowels 
(van den Broecke 1976) 


Front unrounded Front rounded Back rounded 
Tense, short i 2.00 y 0.39 u 0.56 
Lax, short I 2.79 Y 0.48 2 2.50 
Lax, short [3 4.54 
Tense, long & 2.54 g: 0.06 0: 2.11 
Diphthong ti 1.93 ey 0.37 AU 0.35 


the 1000 most common Dutch words,expressed as percentages of the total number of 
segments. What can also be seen by comparing the figures in the second row with 
those of the first and fourth rows is that lax (short) vowels are more common than 
tense vowels. 


2.5.3 A word of caution 


You may by now have gained the erroneous impression that it is in fact easy to count 
the segments, or syllables, or prosodic distinctions, in a language. We hasten to dispel 
this notion: it is not easy to count these things at all. There are two reasons for this. One 
is that languages frequently have what is sometimes called marginal segments or pat- 
terns of arrangements (Moulton 1962). These are restricted to onomatopoeic words, 
in which the phonology echoes the meaning of the word, and (recent) loanwords. For 
instance, Dutch has a number of such vowels,among which [e:], which occurs in loans 
(e.g. [kre:m] ‘cream’) and in the onomatopoeic ['ble:ra] ‘cry, baw’. Should it be counted 
as a Dutch vowel? Or again, no Dutch words begin with [fj-], except [fjort] ‘fjord’. Is 
[fj-] a Dutch onset, or is it a Norwegian onset which speakers of Dutch have taken in 
their stride? Depending on the answers to such questions, the number of Dutch vowels 
will vary between 16 and 25. The second reason why it is difficult to count these things 
is that the count will depend on the analysis. Suppose a language has the five vowel 
qualities [i,e,a,o,u]. Suppose further that they can be either long or short and, more- 
over, that every combination occurs as a diphthong. The number of different vowels in a 
sense is five, but in another sense it is 30. UPSID would in a case like this give the num- 
ber as 5, but if, say, the combinations [ae], [uu], [eo] had been missing from the lan- 
guage, then, paradoxically, the number of vowels would have been given as 27. Because 
of the large number of languages involved, the trends noted in UPSID are unlikely to 
depend very much on how these decisions are arrived at. What is important is that you 
should not get the idea that it is easy to say what a segment is, and that counting things 
is no problem. 


2.5.4 Speech ergonomics 


Clearly, languages somehow monitor the development of their phonologies, and check 
segments and inventories off against two very general guidelines: ‘Don't make things 
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difficult for the speaker’ and ‘Don’t make things difficult for the listener’. That is, the 
best systems are those in which contrasts are maximally distinct with the least amount 
of articulatory effort (Flemming 1995, Boersma 1998). The reasons for specific statis- 
tical tendencies may therefore be either articulatory or perceptual. In some cases, the 
speaker’s and the listener’s interests may go hand in hand, but often the explanation of 
the statistical fact lies either in the speaker’s interest or the listener's. 


l. 


Plosives are more common than fricatives. Plosives require a brief closure of 
the oral tract in order for air pressure to build up behind it. This is all it takes 
to create a brief friction burst at the release. By contrast, fricatives require con- 
siderable airflow for the full duration of the consonant to keep the air turbu- 
lence going and produce audible friction. The speaker’s interest here favours 
plosives over fricatives. 


Voiceless plosives are more common than voiced ones. With every opening 
action of the vibrating vocal folds, the speaker releases some air from the trachea 
into the vocal tract. Because the vocal tract is closed off during the closure phase 
of the plosive, the air pressure in it will rise until it equals the air pressure below 
the vocal folds that drives their vibration. Just as we stop pushing the lever of a 
bicycle pump when the tube is full, so we will avoid continuing the vocal fold 
vibration much during the closure stage of the plosive. No such ergonomic con- 
flict arises during voiceless plosives, there being no requirement for any airflow 
during an open glottis. Again, the speakers interest is at stake here. 


Voiceless fricatives are more common than voiced fricatives. The relatively low 
airflow that results from the release of a rapid series of small air puffs into the 
vocal tract during vocal fold vibration conflicts with the requirement of a gen- 
erous airflow through the narrowed passage where the fricative is articulated. 
As a result, voiced friction is weaker than voiceless friction. 


Front rounded and back unrounded vowels are less common than either 
front unrounded or back rounded vowels. When you place an empty bottle 
under a running tap, the resonance in the bottle set up by the jet of air hitting 
the bottom of the bottle or the surface of the water already in it, increases from 
low pitched to high pitched. Starting from [u], you can reproduce this effect 
by moving into the articulatory position for [i]. At [u], the lips are rounded, 
pouted even, so as to increase the distance between the raised back of the tongue 
and the aperture at the lips, so as to create the longest distance between them. 
But when saying [i], we spread the lips, so as to shorten the distance between 
the raised front of the tongue and the lip aperture. While the articulation of [i] 
corresponds to the fullest state of the bottle and that of [u] to the most empty 
one, front rounded [y] and back unrounded [w] represent intermediate pos- 
itions, which are less likely to be used than the perceptually more extreme 
vowels. Here, the hearer’s interest appears to be the decisive factor. 


Ease of production is most clearly seen in the tendency for particular articulations 
to persist. Prenasalized stops, for instance, always have the same place of articulation 
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for the nasal and the oral stop, as in labial, coronal and dorsal [mb,nd,ng]: segments 
like *[mk] or * [nb] have not been attested. Similarly, nasal consonants in the coda tend to 
share the place of articulation of the following plosive, as in Japanese [rampw] ‘lamp’, 
[jonda] ‘read’ and [mangan] ‘manganese’. And probably all languages have rules of 
assimilation, which cause some articulatory feature of one segment to be transferred 
to an adjacent one, such as when English in is pronounced [1m] in in Paris or [19] in in 
Copenhagen (see Chapter 11). While for the speaker there are obvious advantages in 
extending the scope of an articulatory gesture, for the listener it is better to be able to 
hear differences. Paradoxically, therefore, there is also the tendency for languages to 
avoid repetition of the same thing. For instance, many languages that have labialized 
consonants lack labialized labials. Thus, Bakairi has [t™, k", d”, g™], but lacks [p™, b”], 
even though the nonlabialized plosive series shows no gaps (Wetzels 1997a). Likewise, 
there are many languages that allow complex onsets like [pn,kn] or [pl,kl] but disal- 
low [tn] and [tl], which latter combinations have a single place of articulation. In 
Chapter 9, we will introduce and exemplify the OBLiGATORY Contour PRINCIPLE, which is 
held to be responsible for this avoidance of repetition. 


2.5.5 System gaps 


We have seen that the segment inventories of languages tend to be constructed as if 
languages drew on the stock of phonological resources by adding elements to sets of 
segments. By adding to [p,t,k] the element ‘vocal-fold vibration’, you will produce the 
series [b,d,g]; further adding ‘aspiration’ will produce [p",t",k"], as in Burmese, for 
instance, and so on. Or if you have five vowels, you may allow them to be long as well 
as short, you may nasalize them, or provide them with pharyngealization. (This is part 
of the answer to the secret of the 46 vowels of !Xii.) If phonological resources are typ- 
ically made available per set of segments rather than per individual segment, as 
indeed shown by Clements (2004), you may well wonder why languages so often have 
system gaps. For instance, Dutch has the voiceless unaspirated series [p,t,k], but only 
the voiced [b,d]. So where is [g]? Dutch is not alone in having this gap. It is in fact the 
commonest gap among the voiced stops (e.g. Czech, Hixkaryana, Thai), just as [p] 
is the commonest gap among the voiceless stops, the latter being absent from lan- 
guages as diverse as Arabic, Chuave, Dizi, Hausa, Vietnamese and Yoruba. In the exam- 
ples given here, there appears to be a relation between the gaps and efficiency. The 
voiceless plosive [p] is relatively inefficient from the point of view of the listener, 
because the stop burst, which is one of the major cues to the presence of a plosive, is of 
much lower intensity in the case of [p] than in the case of other plosives, due to the 
lack of a resonating cavity in front of the point of release, where the burst is created 
(Stevens 1997: 494). The voiced [g] is relatively inefficient from the point of view of 
the speaker, because the relatively small air cavity behind the velar closure causes the 
air to accumulate below it, thus increasing the supraglottal air pressure and diminish- 
ing the glottal airflow, and thereby causing voicing to stop (Ohala 1989). That is, [p] 
is relatively hard to hear, and [g] is relatively hard to say. Again, while these statisti- 
cal tendencies are very clear, languages may deviate from them. Thus Hawaiian, 
quite exceptionally, has [p, k], but not [t]. In this case, an earlier stage of the 
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Q13 


in the above discussion on system gaps, the assumption was that [g] was the voiced stop 
whose place of articulation is closest to the larynx. Which voiced stop would you expect to be 
least frequent among languages that have plosive systems that include uvular stops as well as 
velar ones? Motivate your answer. 


On average, voiceless plosives are more frequent than voiced plosives, and coronal 
segments are more frequent than noncoronal segments. This holds true both for the 


occurrence of these segments in the inventories in the languages of the world, and for 
their occurrence in the words (or texts) of any individual language. To represent this 
situation graphically, draw a set of two coordinates, with frequency of occurrence of the 
y-axis and the three places of articulation LABIAL, CORONAL and DORSAL on the x-axis. Draw 
two theoretical graphs, one for the voiceless plosives and one for the voiced plosives. 
English [p,t,k] have frequencies of 1.78, 6.42 and 3.09, while [b,d,g] have frequencies of 
1.97, 5.14 and 1.05 (Gimson 1989: 219). In a set of two coordinates, with frequency of 
occurrence along the y-axis and the three places of articulation on the x-axis, draw two 
graphs, one for the voiceless plosives and one for the voiced plosives. Explain why the 
positions of [p] and [g] differ from those in the ‘theoretical’ graphs of the previous 
question. 


language did have [p, t, k], but somehow [t] was replaced with [k], after [k] had 
become [?]. 


2.6 CONCLUSION 


Obviously, the fact that languages show so many similarities in their sound structures 
cannot be accidental. Apparently, that structure is in part an inherent, innate’ property 
of human beings. And, as we have seen, the universally observed frequency distribu- 
tions appear to be reflected in the frequency of occurrence of segments in individual 
languages. That is, phonologies of different languages are variations on the same theme. 
Ideally, phonological theory provides a model that explains this situation. The most gen- 
eral statement of the aims of phonology is that it seeks to establish the ‘possible space’ of 
phonological structure, and show that the actual phonological systems we find in lan- 
guages fit into that space, while showing that nonexistent structures do not. There are, 
for instance, many processes, but the number of possible processes that are never 
attested is very much larger. By postulating an innate, universal element, it may become 
possible to explain why children learn phonological structures so quickly. That is, 
humans start out with a certain amount of ‘skeletal’ information, which they fill in with 
language-specific information on the basis of the language they are exposed to. With 


Conclusion 33 


regard to syllable structure, for instance, the innate information might be that there are 
syllables, and that syllables have peaks and onsets. What the child would want to know 
next is (1) whether the onset may remain empty, and/or (2) whether there may be a coda 
(Clements and Keyser 1983: 29; Kaye 1989: 56; Blevins 1995). 

A crucial assumption that underlies the notion of a phonological system is that the 
pronunciation of a language can be described with the help of a finite set of discrete 
constituents, i.e. segments, syllables, feet, etc. The pronunciation of every morpheme 
consists of a particular configuration of those constituents. These phonological con- 
stituents are meaningless, and distinct from the meaningful, morphosyntactic 
constituents of the language, such as morphemes, morphological words, syntactic 
phrases and sentences. Later, in Chapter 15, we will see how phonological structure 
continues above the level of the word. 


Making the form fit 


3.1 INTRODUCTION 


In Chapter 2 we saw that different languages have different phonologies. One of the 
clearest illustrations of this fact is provided by the adaptation of loanwords to the 
phonology of the borrowing language. In this process, speakers will interpret the pro- 
nunciation of the words of the foreign language in terms of the phonological elements 
of their own. The way in which they do this can tell us a great deal about the phonology 
of the speaker’s native language. For example, the French pronunciation [fi'lin] for 
English [‘fi:liy] reveals that French does not distinguish tense and lax vowels, and 
uses [i] for both [i:] and [1]. Second, it places the word stress on the last syllable, regard- 
less of where the stress was in the original word. In this chapter we will discuss the 
process of nativization, and illustrate it mainly on the basis of English loans in 
Hawaiian and one Indonesian loanword in Konjo. These languages have very different 
phonologies, the phonology of English being much more complex than that of 
Hawaiian, in particular. After showing how the pronunciation of foreign words is 
constrained - or shaped - by the phonological structure of the native language, it is 
pointed out that the phonological representation of native morphemes, too, may need 
to be adjusted. This need may arise when morphemes are combined. If a language 
with the syllable structure (C)V(C) only allows a coda consonant in word-final posi- 
tion, something will have to be done whenever a consonant-initial suffix like [ka] is 
attached to a consonant-final base like [taf], since *[tafka] would be ill-formed. In 
order to describe phonological adjustments, two approaches have been adopted: rules 
and constraints. The difference between these two approaches is briefly explained and 
illustrated. j 


3.2 HAWAIIAN 


First, we give a brief outline of Hawaiian phonology, based on Elbert and Pukui 
(1979). The syllable structure is characterized by the formula in (1). 


(1) (OV(V) 


Adjustment processes 35 


That is, syllables do not have codas, vowels can be long or short, and the onset contains 
maximally one consonant. The Hawaiian phoneme inventory is extremely small. It is 
given in tabular form in (2). The rows in the C-system stand for manners of articula- 
tion, usually in the order ‘plosive (and affricate)’, ‘fricative’, ‘nasal’, ‘approximant’, 
while the columns stand for places of articulation, ordered from labial to glottal. 
Voiceless and voiced obstruents are usually presented in separate rows. For vowels, the 
rows stand for degrees of tongue height, while the columns have the order ‘front 
unrounded’, ‘front rounded’, ‘back unrounded’ and ‘back rounded’. 


(2) V C 
i p k ? 
e h 
à m n 
w l 


Consecutive vowels sound like diphthongs if the second is higher than the first (e.g. [au] 
or [oi]), and like long vowels if they are identical (i.e. [aa] sounds like [a:].) In terms of 
(1), these vowel sequences are VV, and are monosyllabic. Other sequences of vowels are 
divided over two syllables. A [j] is inserted between [i,e] and a lower vowel, and [w] is 
inserted between [u,0] and a lower vowel. The latter consonant is indistinguishable from 
the unpredictable occurrence of the [w] listed in (2). A sequence like [ua] is therefore 
equivalent to [uwa], and [ia] is pronounced [ija]. Possible Hawaiian words are 
[iwa] ‘nine’, [ni?ihao] (geographical name), [honolulu] (geographical name), [aa] ‘jaw’, 
[?aa] ‘fiery’, [pua?ohi] ‘chatter’, [kaukau] ‘admonish’ and [ho?olaule?a] ‘celebration’. 
Impossible words are *[tu?a], *[plai] and *[kehunanal]. 


Q15 Explain why the last three items cannot be Hawaiian words. 


3.3 ADJUSTMENT PROCESSES 


3.3.1 The process of nativization 


A speaker of Hawaiian speaking English might well wish you [mele kelikimaka] on 
25 December. This utterance is the result of the interpretation of [meri krismas]. The 
situation exemplified here is representative of the sort of thing that happens when 
speakers of one language decide to speak another language without adopting any of 
the phonology of that other language. When faced with the task of pronouncing an 
expression in a foreign language while using only the phonology of their native lan- 
guage, speakers need to (a) interpret each of the segments in the foreign word in terms 
of the native segment system; and (b) make sure that no strings arise that break the 
syllable structure constraints or any other phonotactic constraints of their language. 
These two types of processes should be seen as different parses, according to Silverman 
(1992). The first parse takes place at the Perceptual Level: the acoustic input, or the 
acoustic image that the speaker has of the foreign word, is interpreted as a string of 
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ee ~ Aadne 


native segments. In the case of Merry Christmas, this process must have resulted in the 
segments in the second column of (3). The segments in the first column represent the 
English interpretation of this expression. 


(3) Input Perceptual Operative 


Level Level 
m m m 
€ e e 
J ji l 
i e e 
k k k 
e (cf.(1)) 
J l l 
I i i 
s k k 
i(cf.(1)) 
m m m 
a a 
s k k 
a (cf.(1)) 


Compared to the English segments, Hawaiian has phonetically quite similar segments 
available in most cases. The fact that there is no [r] type consonant accounts for the 
interpretation of [1] as [l], while [k], being the only lingual nonsonorant consonant, 
is the best interpretation of [s]. Now notice that this string of segments (those in the 
second column) is not well-formed. In particular, it cannot be analysed as sequences 
of (1), with or without the optional elements in that formula. The aim of the second 
parse, referred to as the Operative Level, is to make the string of segments perceived 
at the Perceptual Level conform to the phonotactic constraints of the language. Most 
importantly, the segments will have to be accommodated by giving them legitimate 
positions in syllable structure. Rather than throwing [I] in Christmas out, therefore, 
a vowel is inserted between [k] and [I]. As a result, both [k] and [I] are now single con- 
sonants in their onsets, as required by (1). Similarly, vowels are inserted after the sec- 
ond and third [k]. (The new vowel would appear to be a copy of a nearby vowel, but we 
will ignore this aspect.) 

Silverman's division of the process of nativization into a Perceptual Level and an 
Operative Level is convenient, and allows us to see the distinction between the phon- 
ological elements (vowels, consonants) and the phonological structure containing the 
segments. Respectively, they may be thought of as the things that need to be packaged 
and the legitimate ways of packaging them. However, we cannot really separate the 
two processes in the sense that the segments produced by the perceptual parse must 
be accommodated by the phonological structure at all costs: it is not always the most 
likely segmental interpretations that survive. Instead of supplying a vowel in order to 
make the segment string conform to the native syllable structure, the language might 
adjust the string of segments by replacing a consonant with one that is not too different 
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from it, but which can be accommodated without the addition. of a syllable. For 
instance, German does not allow voiced obstruents in the coda, which do occur in 
English. In (4),a [b] in the English input for a German loanword is ultimately replaced 
with a [p], because the presence of [b] is ill-formed in that position. A solution that 
the German speaker does not resort to is to supply a vowel after the [b] so that it can 
be preserved as an onset consonant. 


(4) Input Perceptual Operative 


Level Level 
P P P 
A a a 
b b p (to preserve final position of plosive) 


Another case of an alternative choice of consonant comes from Konjo. In this lan- 
guage a word must minimally consist of a foot, which in Konjo is disyllabic and has the 
stress on the first syllable. It has the vowels [i,e,a,0,u], and its syllable structure is 
(C)V(C). Word-finally, the coda can only be [?] or [p]. It has a full set of nasals 
({m,n,n,n]) as well as a lateral (Friberg and Friberg 1991). 


Q16 Why are the phonological representations [men] and [le?em] not possible Konjo 
words? 


Konjo has adopted the Indonesian word [lem] ‘glue (itself a loan from Dutch [leim]) as 
['le?en]. At first sight, this may be a little surprising. Note that [‘leme] would be a possi- 
ble word; in fact, the form [lame] is an actual word: ‘tuber’. The form ['le?en] conforms 
to Konjo phonology in that it is disyllabic and does not have [m] word-finally. Here, the 
Perceptual Level analysis must be [lem], but as in the case of the German loan from 
English, the first decision at the Operative Level was to replace an impossible coda con- 
sonant with a possible one that is similar: [m] is replaced with [ņ] (see (5)). 


(5) Input Perceptual Operative 
Level Level 
l l l 
e e e 
? (to serve as C in inserted CV syllable) 
e (to serve as V in inserted CV syllable) 
m m n (to preserve final position of the nasal) 


Incidentally, sometimes a segment that is present in the original form is not trans- 
lated in the loanword, something that is likely to happen if the segment is not partic- 
ularly salient. An example is the Cantonese word [lip] ‘lift’, an adaptation of English 
[lift]. Here, while the first three segments in the English original have been translated, 
the final [t] was simply left uninterpreted. In this section it has been shown that the 
pronunciation of foreign words is adjusted to the phonological structure of the native 
language. In the next section we will see that in the native phonology similar adjust- 
ment processes may occur. 
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Q17 To salvage consonants that would otherwise be illegitimate codas, Japanese 
provides the vowel [w] to allow them to be onsets, as in [tfi:zw] ‘cheese’ and 
[maswikw] ‘mask’. However, after [t] the vowel [o] is used, as in [to:suito] ‘toast’. 
Referring back to Q6, can you explain this fact? 


Q18 Here are some Japanese loans from Dutch (Vos 1963, de Graaf 1990). 


Loanword Dutch origin Gloss 

so:da so:da: ‘soda’ 

ko:hi: kofi ‘coffee’ 
korera xo:lara: ‘cholera’ 
merankori: me:lanxo:li ‘melancholy’ 
hipokonderi: ` hipo:xondri ‘hypochondria’ 
mangan manya:n ‘manganese’ 
bombon bombon ‘kind of sweet’ 
gomuw yym ‘rubber’ 

kari ka:li ‘potash’ 
kiniine kinina ‘quinine’ 
mesw mes ‘knife’ 
karwkw kalak ‘chalk’ 
maswto mast ‘mast’ 
buuriki blīk ‘tin’ 
orwgoirw oryal ‘pipe organ’ 
swkop:w sx3p ‘spade’ 
pompw pomp ‘pump’ 
piswtorw pisto:l ‘pistol’ 
gipwsw yIps ‘plaster cast’ 
kan kan ‘kettle’ 
rampu lamp ‘lamp’ 


List the consonants that appear word-finally in the Japanese words. 

List the consonants that can appear in the coda of nonfinal syllables. 

Why is a vowel added after the word for ‘rubber’, but not after the word for 
‘manganese’? 

On the basis of these data, what would you say are the voiceless fricatives of 
Japanese? 

What evidence is there that Japanese does not allow CC-onsets? (In fact, Japanese 
allows CC-onsets, provided the second C is [j], as in the geographical name 
[kjo:to].) 
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Q19 Study the following Hawaiian loanwords from English carefully. 


English Hawaiian English Hawaiian 

‘elbat ?alapaki ‘Albert’ wain waina ‘wine’ 

‘trkrt kikiki ‘ticket’ rars laiki ‘rice’ 

saup kopa ‘soap’ bel pele ‘bell’ 

bia pia ‘beer’ ‘flava palaoa ‘flour’ 

kilt kiliki ‘kilt 'zəudiæk kokiaka ‘zodiac’ 

braf palaki ‘brush’ Qavzand kaukani ‘thousand’ 

Stri kole ‘story’ pa:m paama ‘palm’ 

sku:l kola ‘school’ _‘slafant ?elepani ‘elephant’ 

1. List the onset and coda clusters in the input forms that are broken up into different 
syllables in the Hawaiian output forms. 

2. List the consonant clusters that are not so broken up, i.e. that are not fully interpreted. 

3. List the English consonants for which Hawaiian [p] is used. What articulatory aspect do 
these consonants have in common? ; 

4. List the English consonants for which Hawaiian [k] is used. What articulatory aspect do 
these consonants have in common? 

5. Can you make a guess as to what the nativized Hawaiian form for English false might be? 

6. If someone asked you why the Hawaiian speaker shouldn't simply say [s] instead of {k] 

in words like soap and Christmas, what would your answer be? Would it be correct to answer 

that the fully native speaker of Hawaiian cannot hear the difference between [{s] and [k]? 


3.3.2 Adjustments in the native vocabulary 


It may seem self-evident that the phonological shape of the morphemes of a language 
will conform to the structural constraints holding in that language. By and large, 
this is true: a language that disallows complex onsets will typically not have words 
with complex onsets. Yet it frequently happens that ill-formed structures arise in 
native forms. For one thing, the phonological shape of morphemes that cannot by 
themselves be words, like affixes, need not conform to the constraints on syllable 
structure. An affix may consist of a single consonant, for instance, and as such cannot 
be a good syllable. Usually, well-formed syllables are only required at the level of the 
word. More generally, when suffixes are attached to bases to form complex words, or 
when words come together to form phrases, there is no guarantee that the phonologi- 
cal shape of the combination is well-formed, and adjustments are therefore frequently 
called for. For example, as will be seen in Chapter 4, the form of the English nominal 
plural suffix is [z]. This particular phonological representation is fine when the mor- 
pheme attaches to eye to form [arz], but it cannot be used in that same form in com- 
bination with either nose or back. In the first case, a vowel is inserted between the stem 
and the suffix, to form [nouziz], as the form *[n9uzz] is ill-formed; in the second case, 
[beks] is formed, it being impossible to have two adjacent obstruents in the same syl- 
lable that differ in voicing (*[bakz]). As a result, the suffix [z] has three different pro- 
nunciations, or morpheme alternants, [z], [s] and [1z]. 
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3.4 TWO APPROACHES 


There have been two approaches to the question of how phonological adjustments of 
(native) morphemes should be described, one based on rules that change representa- 
tions and one based on constraints that require representations to have certain forms. 


3.4.1 Rules 


The traditional approach, associated with the monumental work by Noam Chomsky 
and Morris Halle, The Sound Pattern of English (SPE) (1968), uses rules that change the 
phonological representation of the morpheme in particular phonological contexts. 
These rules are ordered: each rule except the first applies to the output of the preceding 
rule. After all the appropriate changes have been made, the correct form surfaces. This 
is a derivational approach, in the sense that the surface form of the expression is 
derived in a series of structure-changing operations from the underlying form. 

In the case of the English plural suffix, the derivational approach postulates two 
rules. One, I-INSERTION (6), inserts a vowel. between two sibilants ([s,z,J,3]). The other, 
DEVOICING (7), devoices an obstruent after a voiceless obstruent in the same syllable. 


(6) ”LI-INSERTION: Insert [1] between two adjacent sibilants in the same word. 
(7) _ pevoicine: A voiced obstruent becomes voiceless after a voiceless obstruent. 


The adjustments are given in the derivation (8). The first line represents the underlying 
forms and the last line the surface forms. Intervening lines show the working of the 
phonological rules, in the order in which they apply. Thus rule (6), I-INSERTION, does not 
apply to [bak-z] and [ar-z] because these forms have no instances of adjacent 
sibilants, but it does apply to [kis-z], which contains a sequence of [z] and [s] in a 
single word. Non-application is sometimes explicitly indicated by (n.a.), as has been 
done here. 


(8) Input representations bek-z kis-z al-Z 
Rule (6) (n.a.) I (n.a.) 
Rule (7) s (n.a.) (n.a.) 
Output representations beks kisiz alz 


An important advantage of applying rules in sequence is that they can express gener- 
alizations in simple ways. Notice, for example, that DevoiciNG can be formulated in the 
simple way that is has been in (7), thanks to the work of 1-INSERTION. If we were to apply 
DEVOICING before 1-INSERTION, it would change the [z] after [kis] into [s], because it 
appears after a voiceless obstruent in the input. By first inserting the [1], the [z] is no 


longer adjacent to [s], and is skipped by pevoicine. We will discuss this point further in 
Chapter 6. 


3.4.2 Constraints 


Over the last years, a different approach has been taken to the adjustments. In this 
constraint-based approach, demands are put on the surface form, and any form that 
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does not comply with these constraints is rejected in favour of a form that does. The 
most successful constraint-based theory is Optimality Theory (Prince and Smolensky 
1993, McCarthy and Prince 1993a, Kager 1999). This theory holds that constraints are 
universal. There are two important features of the theory that explain why languages 
nevertheless have different phonologies. First, languages differ in the importance they 
attach to the various constraints. That is, the phonology of a language is given by the 
ranking of the set of universal constraints, known as that language’s constraint hier- 
archy. Second, constraints may be contradictory, and thus be violated: if two con- 
straints are contradictory, the one that is ranked higher will have priority. 

A constraint-based approach to the English nominal plurals might postulate con- 
straints (9) and (10). According to (9), sequences of sibilants are excluded within the 
same word. And according to the constraint in (10), we cannot have a sequence of 
obstruents that differ in that one is voiced and the other is voiceless. Thus, a form like 
*[nəuzz] violates the first constraint, and a form like *[bekz] the second. 


(9)  *SipSis: Sequences of sibilants are prohibited within the word. 
(10) *avorce-Gvoice: Sequences of obstruents within the syllable must agree for voicing. 


So how does Optimality Theory determine what the output form must be? For any 
given input form, there will initially be an unlimited set of output forms. This free 
generation of potential output forms is taken care of by a function called Gen (for 
‘Generator’), which is subject only to very general constraints of well-formedness. 
Let us illustrate this with the plural for [kis]. For an input form [kis-z], some of 
the generated outputs will be: [kisIz], [k1sz], [kıs], [kızz], but also forms like [pets] or 
[tæpt]. There are two general forces at work that determine which of these numerous 
potential output forms is chosen by the language. One of these forces is called faith- 
fulness: it is the force that tries to make the output form identical to the input form. 
Thus, if English were completely faithful, the plural of kiss would be [kisz]. The other 
force might be said to be the unmarked way of pronouncing things. If this force were 
allowed to have its way, unchecked by any other force, all words in the language, or 
indeed in all languages, would end up as something like [ba], or perhaps [ta]: any- 
thing more than this would be more ‘marked in the sense of less common, more com- 
plex and more difficult to pronounce. In reality, the outcome is determined by how 
these two forces interact. Each of the forces is represented by a set of universal con- 
straints, and every language ranks these constraints in its own way. Again, if all the 
Faithfulness constraints are ranked above all the phonological constraints, no phono- 
logical adjustments will be made to the input form. However, typically one or more 
phonological constraints are ranked above one or more Faithfulness constraints, 
which means that in the case of a conflict, the phonological constraint wins. Every 
constraint that is inspected will thus throw out a number of candidate forms, and this 
process goes on until there is only one form left. Optimality Theory thus holds that the 
output form is the optimal form, the form that is left as the only survivor of all can- 
didate forms after an inspection of the constraint hierarchy. 

McCarthy and Prince (1995) propose three important constraints to express 
Faithfulness. Max-IO requires that each segment in the input form (T) has a 
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corresponding segment in the output form (‘0’). That is, the input is ‘maximally’ repre- 
sented in the output, and the constraint is therefore violated if a segment is deleted. 
Dep-IO requires that each segment in the output form has a corresponding segment in 
the input form. That is, the output must be entirely ‘dependent on the input, and the 
constraint is violated by any inserted segment. Third, Ipenr(F) requires that every fea- 
ture (‘F’) of the input segment is ‘identical’ to every feature in the output segment. That 
is, this constraint is violated if a segment changes from voiceless [t] to voiced [d], say, 
or from bilabial [m] to dorsal [n]. (The theory allows these constraints to be split up 
into detailed subconstraints, and they are therefore better seen as constraint families.) 
There are many phonological constraints, some of which we will present informally 
below. It will be clear that the output form will be as close as possible to the input form, 
and that every deviation must be forced by some higher-ranking phonological con- 
straint. It will thus be clear that an output [tæpt] for an input [kisz] is unlikely to 
survive an evaluation by the Faithfulness constraints, which will quickly see to it that 
[tæpt] is discarded in favour of forms that make a better job of preserving the input. 


(11) Max-IO: Deletion of segments is prohibited. 
(12) Der-IO: Insertion of segments is prohibited. 


(13)  Ipent(F): A segment in the input is identical to the corresponding segment in 
the output. 


The operation of evaluating the collection of possible output forms is called Evar (for 
‘Evaluation ). This evaluation is shown in tableaux. The tableau in (14) will serve as an 
illustration. The constraints are arranged in the columns, and the forms to be evaluated 
are arranged in the rows. The input form to be evaluated is given in the top left corner. 
A * ina cell indicates that the form of that row breaks the constraint in that column, and 
*! indicates that such a violation eliminates that form from further consideration: the 

violation is fatal. The optimal form, the winner, is marked sw. Shaded cells indicate that 
the constraint in that column has become irrelevant to the fate of the form in the row 
concerned. As is shown in (14), it is more important in English to obey *SisSis than to 
obey Dep-IO: in order to prevent the adjacency of [s] and [z], a segment [1] is inserted 
between them. However, as shown in tableau (15), the language is less concerned about 
*avoice—avoice: although it could have saved the voiced [z] from ‘becoming’ [s] in 
[bækz] by inserting a vowel to form [bækız], it chooses not to break Dep-I0 for this pur- 
pose. To satisfy *avoice—avoice, it is however prepared to violate Ipent(F) in the matter of 
the voicing of [z]: the input [z] corresponds to a non-identical output [s]. 


rake nsf 


(14) 
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(15) *SibSib *avoice—avoice Ident (F) 


Incidentally, in addition to (9) and (10), there would have to be a constraint requiring 
that the phonological content of the stem must be retained. Otherwise, *[bægz], which 
satisfies *avoice—avoice, would also be described as a correct form, by the side 
of [baeks]. 


Q20 The form *[kis] is an incorrect output form for the plural kisses. 


1. Which constraint (not listed in tableaux (14) and (15)) is responsible for ruling this 
form out? 

2. Draw a tableau for the input form [kisz] (i.e. the plural of kiss), with the two potential 
output forms [k1z] and [krsiz]. There should only be two constraints in your tableau: 
Der-10 and the constraint you gave as the answer to the previous question. Would you 
rank the latter constraint above or below Dep-l0? Motivate your answer. 


Q21 The consonant inventory of Mauritian Creole contains the coronal fricatives [s,z]. 
Palatoalveolar fricatives are absent. French words containing such fricatives, like 
[fe'va] ‘hair’ and [zeneral] ‘general’, are adapted as [seve] and [zene'ral], respectively. 
Assume the two constraints IDENT(F) and *J/3. Draw a constraint tableau with the 
correct ranking of these two constraints which shows the fate of the possible output 
form [seve] and [Jeve] in Mauritian Creole. 


3.5 CHOOSING BETWEEN RULES AND CONSTRAINTS 


You may now think that there is not much difference between the two approaches. 
There are, however, two important differences. First, in a derivational approach, con- 
straints and rules will sometimes both be needed and, moreover, appear to do the 
same work. Crucially, a rule-based grammar will also need constraints to characterize 
the phonological well-formedness of morphemes that have only a single form. For 
instance, the constraint *avoice-avoice (10) is generally needed to characterize the 
well-formed syllables of English: *[æbs], [*æpz], *[zti:], *[sbi:] are ungrammatical, 
while [əks] and [dz] are fine, and are indeed used for the words ox and adze. 
However, because the form [beks] needs to be derived from [bek-z], a rule of pevoic- 
ING (7) is required to effect the change from [z] to [s]. That is, the rule-based approach 
needs both (7) and (10), even though they would appear to be describing the same 
regularity of English. This inherent drawback of the rule-based approach is known as 
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the duplication problem (Kenstowicz and Kisseberth 1977: 136).A constraint-based 
approach does not run into this problem: the constraint on English syllable structure 
will act as a condition on all forms, regardless of whether they are morphologically 
derived or not. 

A second difference concerns the phonological adjustments that only appear in 
loanwords. For the purposes of the native phonology, Hawaiian does not require a rule 
that breaks up consonant clusters, since all the morphemes of the language conform 
to the requirement of the structure (C)V(V): there simply are no morphemes that 
begin with consonant clusters. In order to account for the adjustment made when a 
word containing a complex onset like [kr] is adapted to the native phonology, the 
derivational approach will have to add such a rule to the phonology (in this case, one 
to change [r] into [I], and one to insert a vowel between the [k] and the [I]). In a 
constraint-based description, no such ‘extra grammar would need to be supplied. To 
characterize the native Hawaiian forms, a constraint forbidding complex onsets would 
need to be undominated anyway, as the language allows no exceptions. That is, in 
general, in an OT description, the constraints would be ranked so as to bring any new 
input forms in line with the structural demands of the language. 

To illustrate this latter point, consider again the Konjo word for ‘glue’, [‘le?en]. The 
input form is the Indonesian word [lem]. Among the generated output forms there 
will be forms like [lem], ['lemem], [le?], etc. Why isnt the output [lem]? This is 
because the language has phonological constraints that are ranked higher than 
the Faithfulness constraints. One such constraint is that the output should minimally 
be a disyllabic foot, a constraint that is undominated in the language, so that there are 
no exceptions. 


(16) | MinWorp: A word is minimally a (binary) foot. 


Since MinWord outranks Dep-IO, a disyllabic form will be preferred to the monosyl- 
labic [lem]. Let us suppose that we have evaluated the potential outputs up to a point 
where all monosyllabic candidates (e.g. [len], [lem], [le?]) have been discarded by 
MinWorp. There are two further constraints that are relevant at this point. One, 
CoDACONDITION (17), forbids the presence of consonants other than [0,?] in the coda. 


(17)  CopaConpition: A coda consonant is [?] or [p]. 


After inspection of CopaConpition, there are two reasonably faithful candidates left: 
['le.?en] and ['le.me]. (We will ignore the question why [?] and [e] are the best 
choices here for the inserted segments.) The second constraint that is relevant at this 
point is ALiGN-Stem-Rict (18), which requires that no segments should be added to, or 
removed from, the end of the input. Formally, the requirement is that the end of [lem] 
should coincide with the end of a syllable in the output, but we give a simplified for- 
mulation here. 


(18)  Aticn-Stem-Ricut: The last segment of the input corresponds with the last 
segment of the output. 
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In Konjo, ALIGN-Stem-RicuT outranks Dep-IO. The form ['le?en] passes the higher- 
ranked constraint, because the end of the last syllable, [?en], coincides with the end 
of the input form [lem], and ['leme] fails it, because the end of the syllable [me] 
does not. These correspondences are shown in (19). (We could of course, perversely, 
assume different correspondences. For instance, the output [m] in (19a) might be 
taken to correspond to the second [e] in the input, but that output form would be 
thrown out on the grounds of multiple violations of specific versions of Ipent(F). Not 
even the consonantal nature of [m] would have been preserved, and an assumption of 
a high-ranking Ipenr([consonantal]) would be enough to discard that output form.) 


(19) xole? en B ig 
| | | oa ae 
le m lem 

The fact that ['le?en] fails the lower ranking Dep-IO is of no importance: since there 
are no other candidates left, it is irrelevant whether the successful candidate breaks 
any constraints ranked below the constraint that eliminated the last rival form(s). 
Neither is it of any relevance that forms that failed to pass a constraint satisfy any 
lower-ranked constraints. The tableau in (20) schematizes the situation. 


| tem O Align-stem-right Dep-IO 


(20) 


In other words, the adjustment of Indonesian [lem] to Konjo ['le?en] is accounted for 
with the help of the same constraint hierarchy as is used for characterizing the native 
forms of the language (Yip 1993, Jacobs and Gussenhoven 2000). It would therefore 
appear that, at least in principle, OT scores over a description that uses structure- 
changing rules in two ways. First; OT can characterize the phonological grammar of a 
language in a way that avoids having to state the same information twice, once as a 
constraint on monomorphemic forms and once in terms of a rule that changes poly- 
morphemic forms so as to conform to that constraint. Second, to account for phono- 
logical adjustments that are only observed in the adaptation of loanwords (i.e. never 
in the native forms), a rule-based analysis must add rules to the native grammar that 
bring those adaptations about. By contrast, the constraints that in an OT analysis char- 
acterize the native forms should equally characterize the optimal form of any incom- 
ing loanword. 


3.5.1 Gradient violation and unranked constraints 


Sometimes, two forms break the same constraint, but one of them can be said to vio- 
late the constraint more than the other form. Also, it will frequently happen that two 
constraints are never contradictory and therefore that their ranking makes no differ- 
ence. We illustrate the first point with the help of a theoretical language, Un-Konjo, in 
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which the ranking of AuiGN-Stem-Ricut and DEp-IO is the reverse of that in Konjo. This 
has been done in tableau (21). Notice that inspection of the form ['le?en] leads to two 
violations of DEp-IO, compared with only one in the case of [‘leme]. Multiple (or gra- 
dient’) violation is shown by the number of stars in the relevant cells. Since the form 
[‘le?en], with two inserted segments, is a worse candidate for the purposes of this 
constraint than ['leme], which has only one inserted segment, it is eliminated. Thus 
[‘leme] is the winner in Un-Konjo, and ALIGN-STEM-RIGHT, along with all other lower- 
ranked constraints, is left uninspected. 


Let us go back to Konjo. Observe, again, that the potential output form *[le?em] is 
non-optimal because it breaks CopaConp. Like Aticn-Stem-Ricur, this constraint must 
therefore be ranked above IDENT(F), which requires that [m] remains [m]. However, the 
ranking of CopaConp with respect to the two constraints in tableau (20) is indifferent. 
We could either insert it before or after them. In tableau (22) we have placed it in first 
position; to indicate that its ranking is indifferent, it is separated from ArIGN-STEM- 
RiGHT by a dotted line. Likewise, while IDENT(F) is ranked below CopaConp and ALIGN- 
STEM-RIGHT, it does not interact with Dep-IO. 


CodaCond | Align-stem-right Dep-IO Ident(F) 


Q22 Draw two tableaux that are identical to tableau (22), except that CopaCono is, 
respectively, in third and in fourth position. Please motivate your answers to the 
questions below. : 


1. Is Dep-IO crucially ranked above CopaConn? 
2. ls CopaCono crucially ranked above IDENT(F)? 


These brief illustrations of the rule-based approach (section 3.4) and of a constraint- 
based approach (this section) give only the main ideas behind the derivational theory 
of SPE and of Optimality Theory. In later chapters we will illustrate both approaches 
more extensively. In Chapter 7 we present a description of the diminutive forms in two 


Conclusion 47 


dialects of Dutch in the derivational framework of SPE, and in Chapter 14 we give an 
account of the word-stress locations in a number of languages in the framework of 
Optimality Theory. 


3.6 CONCLUSION 


In this chapter we have seen that the phonologies of languages actively impose phono- 
logical adjustments on input forms. Such adjustments are most readily observable in 
loanwords. However, the need to make adjustments also arises in the native vocabu- 
lary of the language when morphemes are combined in words and phrases, since such 
combination may lead to the creation of phonological representations that are ill- 
formed. There are two ways in which phonological adjustments have been described. 
They can be described with the help of a series of rules, which successively change the 
representation so as to make it conform to the requirements of the language. Second, 
they can be described by means of output constraints that state what forms must look 
like. Optimality Theory is a constraint-based theory that postulates that a language 
can be characterized by a ranking of a set of violable, universal output constraints, and 
that the correct form is the form that violates the constraints least. 


Underlying and surface representations 


4.1 _ INTRODUCTION 


In this chapter we discuss the need for an underlying phonological representation of 
morphemes, in addition to the surface representation. We begin by showing that a good 
deal of the surface pronunciation of words is predictable, as a result of the fact that the 
pronunciation of segments varies with the phonological context they occur in. Such 
allophonic information is assumed not to be included in the underlying representa- 
tions of morphemes, because it can be automatically supplied in the surface representa- 
tion. We distinguish allophonic variation from situation-dependent ‘stylistic variation. 
Next, we point out that the need for an underlying representation of a morpheme can 
also be motivated on the basis of morphemes that have a number of context-dependent 
surface alternants, like the English plural suffix, with its alternants [s,z,1z]. Here, the 
argument is that the underlying form expresses the phonological unity of the mor- 
pheme’s variants. Third, the existence of an underlying representation can be motivated 
on the basis of the impossibility of expressing certain phonological generalizations with 
reference to the surface representation. We close the chapter with a consideration of the 
question of how the phonological representation of the underlying form is determined. 


4.2 ALLOPHONIC VARIATION 


The ‘segments’ referred to so far in this book typically display a good deal of variation, 
even in the speech of a single speaker. Some of this variation is apparently random, but 
much of it is systematic. Two factors are responsible for this intraspeaker variation. One is 
style, or the degree of formality of the speech situation. For instance, in American 
English, an utterance like Right! will frequently have an unreleased final plosive (symbol- 
ized by means of the symbol for the plosive and a superscript ["], as in [rart']), but in for- 
mal speech styles the [t] may well be released (so that you can hear a weakish [s]-like 
sound after it). In sociolinguistic studies, the stylistic variants [t"] and [t°] are accord- 
ingly seen as the possible values of a phonological variable (t), whose frequencies of 
occurrence are compared across different speech styles for different groups of speakers. 
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The second type of variation is due to variation in the phonological context. Such 
context-dependent variants are called allophones. The term phoneme is used to refer to 
the segment category that the various allophones are variants of. To distinguish the 
phoneme as a segment category from the individual allophones, it is often placed 
between slashes: / /. For example, in British English there exist two rather different pro- 
nunciations of the phoneme /I/. When it occurs in the onset, it is pronounced as an alveo- 
lar lateral approximant, perhaps with some slight palatalization, as in leek, follow, a 
segment which is known as ‘clear’. In the coda, as in ill, cold, its alveolar contact is accom- 
panied by retraction of the tongue body towards the uvula, a sound symbolized by [ł] and 
commonly known as ‘dark P. Again, in many languages, coronal consonants alternate 
with palatoalveolar consonants, whereby the palatoalveolar one appears before [i], or 
sometimes before other front vowels as well. Thus, in Korean, underlying [sipsam] is pro- 
nounced [fipsam] ‘thirteen’, while Igbo has [f] before [i,e] and [s] elsewhere, as shown by 
[ofiri] he cooked’, [ofere] ‘he said’ vs [osere] ‘he wrote’ and [sara] ‘he washed’ (Jones 
1967: 21). These examples illustrate the two kinds of phonological contexts that deter- 
mine allophony. One is a particular segment or segment group, and the other is a struc- 
tural position like the coda or the onset of the syllable. 

Although allophonic variation in different languages may show some similarity, as 
suggested by the example of [s] before front vowels, it is important to see that lan- 
guages will differ in the way they use segmental differences to differentiate between 
morphemes. That is, a given segmental difference may be phonemic in one language 
and allophonic in another. While the difference between [s] and [f] is allophonic in 
Korean, i.e. fully conditioned by whether [i] or one of the other vowels of the language 
follows in the word, in English the difference is contrastive: it is used to differentiate 
between morphemes. When a segmental difference is contrastive, there will typically 
be pairs of words that are distinguished only in that one has one segment where the 
other has the other segment. Examples of such minimal pairs in English are [sip - 
Jip] (sip - ship) and [li:s ~ li:f] (lease ~ leash). 


Q23 For each of the three following languages, say whether the difference between [r] 
and [r] is (a) allophonic, (b) phonemic or (c) stylistic. Motivate your answers. 
In some types of Southern Swedish, [R] always appears at the beginning of a syllable and 
[r] always at the end. 
In some types of Dutch, [r] is the usual realization in words like [proXramal] 


‘programme’. Some speakers of those varieties use this pronunciation in everyday 
life, but use [r], as in [pRoXRamad, when announcing programmes on national 
television. 

In Provençal, the word for ‘evening’ is [sero], and the word for ‘saw (noun)’ is [sero] 
(Jones 1967). 


Allophonic variation is entirely predictable. As such, there is no reason to supply allo- 
phonic variation in the lexicon. The only lateral segment to appear in the lexicon of 
British English is just plain [I]. In a rule-based description, a rule of 1-vELARIZATION 
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would be postulated that adds the velarization to the specification of this consonant in 
the context ‘end of syllable’. This is shown in (1) for the words leak and ill. 


(1) Underlying litk il 
]-VELARIZATION (n.a.) } 
Output litk it 


Allophonic differences typically arise because particular contexts invite particular 
adaptations in the production of the segment. As a result, allophones are in comple- 
mentary distribution. The phonological context in which [J] occurs in Korean, i.e. 
before [i], is the complement of the phonological context in which [s] occurs, viz. 
before the remaining vowels of the language: together they make up the phonological 
context in which the underlying segment [s] occurs. 


Q24 In Tolitoli, an alveolar lateral approximant [ij is in complementary distribution with a 
retroflex lateral flap, given as [L]. What determines their distribution (Himmelman 1991)? 


morogo ‘wash hands’ totitoli ‘Tolitoli’ 

utag ‘snake’ kikilo ‘firefly’ 

membembelan ‘to tremble’ mongiutan ‘to choke’ 

labia ‘sago’ lelembatan ‘to carry on the shoulder 
with a stick’ 


Q25 In Korean, [rj and [I] occur in complementary distribution. What determines their 
distribution? 


pal ‘foot’ ‘below’ 


talda ‘sweet’ ‘distance’ 
fadio ‘radio’ 

kul ‘orange’ ‘reaches’ 
solhwa ‘legend’ ‘person’ 


Q26 In Old English, [ç] and [x] are in complementary distribution. State the contexts in 
which each of these allophones occur. 


knict ‘boy’ fe:c8 ‘takes’ 

meax ‘manure’ bro:xta ‘brought’ 
fa:x ‘hostile’ neəlæ:çtə ‘approached’ 
li:çtan ‘alleviate’ bəxtə ‘bought’ 

uxt ‘daybreak’ hi:ç ‘high’ 


A complementary distribution of segments can also arise in a different way, such as 
when one segment happens to be restricted to the onset position and another to the 
coda position. In such cases, the segments are not historically related in the way that 
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contextual allophones like [s] and [J] are. It will then depend on the similarity of the 
two segments whether the phonology ~ and the speaker's intuition ~ treats them as 
contextual variants of the same underlying segment. A well-known case is presented 
by [ņ] and [h] in English. Because [h] may not occur at the end of a syllable and [p] 
may not occur at the beginning of a syllable, the two segments are in complementary 
distribution. (That is, [hep] ‘hang’ is a word, but neither *[teh] nor *[nat] could be a 
word.) There is general agreement that [h] and [9] are underlying segments, since 
there is no plausible way in which one can be derived from the other. 


4.3 TWO LEVELS OF REPRESENTATION 


Why do phonologists assume that there are two levels of representation, an underlying 
one and a surface one? Three arguments have been advanced. 


1. One argument is economy. Why supply allophonic information in the lexical 
entries if it can be stated in a set of allophonic rules that are valid for all mor- 
phemes of the lexicon? The strength of this argument has been called into ques- 
tion by Kenstowicz (1994a: 69), who points out that it is not self-evident that the 
descriptive economy achieved by having allophonic information supplied by 
rules should be reflected in the actual phonological representations in the men- 
tal lexicon. There is apparently no reason to assume that the brain could not 
store all that redundant information for each word in which it occurs. However, 
the issue here is not just storage capacity, but also, or even mainly, search time 
(Lahiri and Marslen-Wilson 1991). It may be expected to be more difficult to 
retrieve the correct phonological form from a set of fully specified, hence com- 
plex, representations than from a set of more economical representations. 


2. A second argument for the assumption of two levels of representation is that 
with a single level it would not be possible to express the phonological related- 
ness of morpheme alternants. We take the English plural suffix as an example. 
Suppose that instead of saying that there is a morpheme [z] which in different 
phonological contexts is adjusted in order to obey the phonological con- 
straints of English, we were to say that three allomorphs are listed in the lex- 
icon, [z], [s] and [1z], each of which is used in a specific phonological 
environment: [1z] after sibilants, [s] after (other) voiceless segments and [z] 
in other cases. This description of the regular plural formation would be cor- 
rect, in that for every noun we can predict the plural form. What the descrip- 
tion fails to express, however, is that, somehow, the three alternants [z], [s] and 
[1z] are the same morpheme. It is not adequate to say that this identity is 
expressed in the semantics. Morphemes that mean the same thing can have 
different phonological forms. An example is provided by the English compar- 
ative, which is expressed by the suffix -er in some adjectives, like nice, but by 
means of the ‘periphrastic more in the case of others, like esthetic. In the case 
of the plural suffix, the availability of two levels of representation makes it 
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possible to state that underlyingly the phonological form is [z], even though in 
surface representations it shows up as [z], [s] and [1z] (Anderson 1974). 
Phonologically motivated morpheme alternation can thus be characterized as 
variation in the shape of the same underlying form in a way that differentiates 
such variation from cases in which the different phonological forms are unre- 
lated, like more and -er, as well as from irregular forms, like went, the past- 
tense form of go, for example, or feet, the plural form of foot. In such cases, the 
allomorphs must be listed in the lexicon, since there is no plausible phonolog- 
ical generalization which could account for them. 

While the case for a single underlying form for the English plural is intu- 
itively very clear, there has been no answer to the general question when dif- 
ferent forms should be related to a single underlying form and when they 
must be listed as separate word forms. Chomsky and Halle (1968) derived 
words like sane and sanity from the same underlying form [s&n], just as pro- 
found and profundity are derived from a common [profiind] (where the over- 
bar indicates tenseness). Others have questioned whether this is realistic, and 
have attempted to develop experimental procedures to answer such questions 
empirically (McCawley 1986, Ohala 1986, Wang and Derwing 1986). 


Q27 Which of the following pairs of English words would you say contains a common 
underlying form? 


sew — sewage cork — corkage 
blow- blew conceive — conception 
talk — talked fraternal — brother 


3. The third reason for the postulation of an underlying representation is that 
many generalizations are only valid at a level other than the surface level. 
Kenstowicz (1994a: 72), citing Mohanan (1992), gives as an example the gener- 
alization that sequences of sibilants are broken up by a vowel in English, which 
is responsible for the fact that the plural of English bus is ['basiz], rather than 
*[basz] (or *[bass]). The generalization also holds in Singapore English: the 
plural of kiss is kiss[az] and that of nose is nos[əz]. In this variety of English, 
there is the further fact that plosives are deleted in the coda after fricatives, so 
that lift, list, task are [hf, lis, ta:s]. When these words are pluralized, they come 
out as [lifs, list, ta:s:], respectively. Apparently, the generalization that adja- 
cent sibilants must be separated by a vowel does not hold at the surface level in 
Singapore English, but is true at a level of representation at which the final plo- 
sive must still be present. Clearly, if we took it to be true at the surface level, the 
generalization would predict that the plural of list, with its surface pronuncia- 
tion [lis:], was *['lisoz]. If instead we assume that, underlyingly, the form [list] 
exists, it quite happily allows [z] to be added, without the need for an inserted 
[ə]. There is in fact independent evidence that the plosives exist at a deeper 
level of representation, because they show up in verbal forms before vowel- 
initial suffixes, as in lif [t]ing, lis[t]ing, tas[k]ing. (The assumption here is that 
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the noun list and the verb list have the same phonological form.) In (2), the sit- 
uation is schematized. The plural morpheme [Z] is attached to the underlying 
forms of kiss and list. The [ə] is inserted between the adjacent sibilants in the 
form for kisses, but not in that for lists, since in the latter form the sibilants are 
separated by [t]. Only after the vowel has been supplied can [t] be deleted. 
From (2), it is clear that the generalization 9-INSERTION is only true for the under- 
lying representation, not for the surface representation. 


(2) UNDERLYING kis-z list-z list-1n 
3-INSERTION kisaz (n.a.) (n.a.) 
t-DELETION (n.a.) 1 (n.a.) 
OTHER RULES (n.a.) Os (n.a.) 
SURFACE kisaz liss listin 


If it were assumed that the underlying form of the verb /ist was [l1s], what incorrect prediction 
would be made about the pronunciation of the present participle form of this verb? 
The past participle of regular Singapore English verbs is formed by suffixing [d] to the 


stem, as in [sard] sighed, [farld] filed. Like the plural suffix [z], it is devoiced after voiceless 
obstruents, as in [w:kt] walked, while [ə] is inserted if the stem ends in [t,d]. If the verb 
list, whose past participle form is flrstad], were underlyingly [lrs], what incorrect prediction 
would be made about its past participle form? Why does the assumption of underlying [lrst] 
make the correct prediction? 


Surface forms that contradict a phonological generalization, like [liss] in the last line 
of (2), are opaque, that is, nontransparent. Taking a rule-based perspective, opacity 
can arise because some rule has failed to apply and the rule’s structural description 
is met in the surface form. This occurs in the case of [liss], in which I-INSERTION appears 
ineffective. Equally, a rule may apply even though the context of the rule is not - no 
longer! - present in the surface form. An example of this type of opacity may occur in 
Japanese, when a vowel is deleted after causing a preceding [t] to be [ts](cf.Q6). When 
the underlying form [twkemono] is pronounced [tskemono], the [w] that triggered 
the affrication of [t] has disappeared between voiceless consonants. Opacity is typi- 
cally dealt with quite adequately in a rule-based description, because rules can be 

- ordered so that segments can be allowed to have active or passive effects before they 
are deleted. In OT, however, opacity is not easily dealt with, not that is, if all constraints 
are by definition valid for output forms. We return to this point in section 6.7. 


4.4 NEUTRALIZATION 


In the case of British English ‘dark I’, the rule we postulated produces a novel segment: 
before the application of the rule, no morpheme contains that segment. However, it is 
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frequently the case that the output of a rule is an already existing segment. This is true, 
for instance, for the rule of pevoicine that devoices the [z] of the English plural suffix to 
[s] after voiceless obstruents, as when [kaet-z] is changed to [kets]. It also holds good 
for FINAL DEVOICING, which exists in German and Dutch, among many other languages. It 
produces segments (voiceless obstruents) in positions where such segments already 
appear. In a Dutch word like [pad] ‘toad’, the final [d] will be devoiced to [t], because 
the consonant occurs in the coda of the syllable. As a result, the singular form is pro- 
nounced [pat], but when the plural suffix [an] is attached, the form is ['padan]. Since 
[t] appears in that position in words like [kat] ‘cat’ (whose plural is ['katan]), the oppo- 
sition between [d] and [t] is neutralized in syllable-final position. A rule like English 
DEVOICING or Dutch FINAL DEVOICING, therefore, is a neutralization rule, while British 
English ]-vELARIZATION is an allophonic rule. We will return to the usefulness (or other- 
wise) of this distinction in section 8.2. 


4.5 CHOOSING THE UNDERLYING FORM 


So far, we have not motivated the choice of the underlying form in a situation where 
there is more than one morpheme alternant. At first sight, we might make the assump- 
tion that any one of the surface alternants could serve as the underlying form from 
which the others are derived. This, however, is far from being the case. In some situ- 
ations, there is only one surface form which could serve as the underlying form. In 
other cases, more than one form could be chosen, but one of these leads to a more eco- 
nomical description than the others. An example of the first situation is provided by 
FINAL DEVOICING, discussed in section 4.4. Recall that the Dutch morpheme for ‘toad’ has 
_ two alternants, [pad] and [pat], occurring in the plural [padan] and the singular 
[pat], respectively. If we assume that the underlying form is indeed [pad], we can 
derive the two alternants with the help of (3). 


(3) FINAL DEvorcinG: Obstruents are voiceless in coda position. 


Notice that this rule also applies in the case of [kat] ‘cat’, which has a plural form 
[‘katan]. It just so happens that the form already incorporates the voiceless segment 
the rule is designed to produce. Such application-without-effect is known as vacuous 
rule application. The derivation is given in (4). 


(4) UNDERLYING REPRESENTATION pad pa.d-ən kat ka.t-an 
FINAL DEVOICING t na. vac. na. 
OUTPUT pat pa.dən kat ka.tan 


But why do it this way round? Why don't we assume that there is a voicing rule that 
voices obstruents in the plural? Let’s assume - for the sake of argument - that the 
underlying form of ‘toad’ is [pat]. Instead of a rule devoicing obstruents in the coda, 
we would need a rule that voices obstruents in the onset, i.e. (5). A situation would 
arise which is shown in (6). Clearly, this description produces the wrong results in the 
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case of the plural of ‘cat’. Rule (5) is not a correct generalization about Dutch, and the 
description in (4) must be considered superior. 


(5) onser voicing: Obstruents in the onset are voiced. 


(6) UNDERLYING REPRESENTATION pat pa.t-an kat ka.t-an 
ONSET VOICING na d na d 
OUTPUT pat  pa.dən kat *ka.dən 


In other cases, more than one solution is available that produces the correct forms, 
but one of them is more economical than any of the others. The underlying form of the 
English plural suffix could be assumed to be [1z], instead of [z]. This choice forces us 
to write a rule of 1-deletion, deleting the vowel of the suffix in all contexts except before 
sibilants. The least one could say here is that it is uneconomical to first assume the 
presence of [1] in all plural forms, and then make it disappear again in almost all con- 
texts. More importantly, the contexts in which [1] would have to be deleted form an 
odd collection: vowels, sonorants, all fricatives except [s,z,f,3] and all plosives. The 
consequence of the choice of [1z] as the underlying form, therefore, is that we must put 
up with a rule containing a very clumsy description of the context in which it applies. 
We will learn in Chapter 5 that such an unnatural context is very difficult to formulate 
in a formal notation. 

The third possible choice, [s], is considerably more reasonable. Here too, however, 
an unnecessarily large group of contexts would have to trigger a phonological change. 
Instead of a devoicing rule making [z] voiceless after [p,t,k,f,8], we would need a rule 
voicing [s] after all vowels, all sonorants, as well as after [b,d,g,v,z,0]. The latter rule 
is less elegant than the devoicing rule, which applies to a smaller and more coherent 
group of segments. 


4.6 CONCLUSION 


The recognition of two levels of representation, a surface representation (SR) and a 
more abstract underlying representation (UR), is the cornerstone of phonological the- 
ory. It makes it possible to describe morpheme alternants as variants of the same mor- 
pheme, and opens the way to a description in which the differences in phonological 
form between the alternants are expressed in terms of general statements about con- 
textually defined phonological adjustments. URs and SRs usually differ from each 
other in that URs are more detailed than SRs, but SRs may also obliterate distinctions 
that exist in URs, and thus neutralize contrasts. There is no easy algorithm which, 
given a range of surface morpheme alternants, will lead to the ‘correct’ UR. In theory, 
URs are chosen so that the resultant grammar is the simplest that can be constructed. 
Neither can the question of whether two forms are to be regarded as morpheme alter- 
nants that have a common UR or as forms that are listed separately in the lexicon be 
answered by referring to some simple criterion. In practice, these questions will not 
often vex the phonologist, since they do not seem too difficult to answer in the majority 
of cases. It may be expected that psycholinguistic research will provide new insights 
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here, in particular where the second question is concerned. The next chapter turns to 
another important aspect of phonological structure: it will identify and motivate the 
the smallest unit of analysis, the phonological feature. 


Q29 The Balantak nouns in the first column of the first data set have the derived forms in 
the second column when prefixed with an affix meaning ‘one’. The second data set 

shows verbs stems and their derivations with a prefix meaning ‘unintentionally’ 

(Busenitz and Busenitz 1991; some of the data are inferred). 


Noun stem Prefixed form Gloss for stem 
wuras sawuras ‘seed’ 

bitu?on sambitu?on ‘month’ 

loloon saloloon ‘thousand’ 
tar? santa:? ‘word’ 

koen sankoen ‘head of grain’ 
utok sanutok ‘brain’ 

sumpir sansumpir ‘beard’ 

apu Sanapu ‘fire’ 

noa sanoa ‘breath’ 
gampal sangampal ‘underlayer’ 
noir sano:r ‘nose’ 
malom samalom ‘night’ 

roon ` saroon ‘banana leaf’ 


Verb stem Prefixed form Gloss for stem 
giok tongiok ‘move’ 

pi:le? tompi:le? ‘see’ 

jon tojo:n ‘shake’ 

kana tonkana ‘hit’ 

wawau towawau ‘do’ 

tobok tontobok ‘stab’ 


lua? tolua? ‘vomit’ 
sosop tonsosop ‘suck’ 
noap tonoap ‘yawn’ 
dawo? tondawo? ‘fall’ 
balo tombalo ‘throw’ 
tunu tontunu ‘burn’ 
ronor toronor ‘hear’ 
us tonuis ‘chew’ 


1. List the alternants of each prefix. 

2. For each alternant, list the initial segment of each base before which the alternant occurs. 
3. For each prefix, decide which alternant is the UR. You must choose the URs so that two 
rules suffice to derive the alternants. 


State (in prose) the two rules that are needed to derive the surface representations. 


Distinctive features 


5.1 INTRODUCTION 


In Chapter 2 we saw that the segment inventories of languages can be divided into 
subgroups. Thus, we have separated the group of vowels from the group of conson- 
ants, and when we discussed the way languages ‘build up’ their inventories, we dis- 
tinguished groups of voiceless obstruents (e.g. [p,t,k]) from voiced groups that are 
otherwise the same (i.e. [b,d,g] in our example). This would appear to suggest that the 
segment is not the smallest constituent of phonological structure. In this chapter, we 
will motivate this assumption. We will introduce the distinctive features that will in 
principle enable us to describe the segments in the world’s languages, and to refer 
to those groups of segments that play a role in their characteristic phonological 
processes and constraints. The latter consideration will be shown to provide an 
important motivation for the assumption of distinctive features. In this perspective, 
these features are the elements by which we can refer to natural segment classes, 
groups of segments that are treated as groups by languages. 

We will see that some features are either present or absent in the representation of 
segments, like LABIAL (univalent features), while others may be present with a minus 
value or a plus value, like [+ voice] (binary features). Also, we will argue that not even 
all binary features need to be present in the representation of all segments. Segments 
may either be unspecifiable for some feature, in which case the feature is irrelevant for 
the segment concerned, or be underspecified for some feature(s), in which case the 
segment will only be provided with those feature(s) in the surface representation. 


5.2 MOTIVATING DISTINCTIVE FEATURES 


As we have seen in the previous chapter, it is possible to see inventories of segments as 
composed of intersecting sets of segments, like ‘all the voiceless segments’ or ‘all the 
segments articulated with lip rounding’ or ‘all the segments with a coronal place of 
articulation’. This fact does not in itself imply that a segment is represented in the 
synchronic structure in terms of a number of separate features, rather than as an 
unanalysable constituent. It could be the case that we are simply dealing with a reflex 
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of the way segment inventories developed historically. Many cathedrals in Europe con- 
tain elements of earlier buildings, but this does not mean that these elements are in 
any sense functional today. 

The chief motivation for the introduction, and hence definition, of a feature is that 
it enables us to characterize a natural segment class. It appears to be the case that 
languages frequently refer to particular groups of segments, while other conceivable 
groupings are never referred to. Thus, languages frequently ban voiced obstruents 
from the final position in the word or from the syllable coda, but no language would 
ban [m,d,3] from the coda, while allowing [b,n,y]. This forces us to recognize that the 
segments in the ‘real’ groups have something in common. If particular groups of seg- 
ments figure again and again in phonological generalizations about syllable structure 
constraints or contextual variation, then evidently those groups must share some fea- 
ture by which phonological grammars can recognize them. 


Which consonants are aspirated syllable-initially in English? it 
If we ignore the ordinal suffix [9], as occurring in sixth, which consonants can appear 


word-finally after [s] in English? 

Which consonants can occur between [s] and [r] in English words? 

What is the significance of the fact that the preceding three questions have the same 
answer? 


In addition to the requirement that distinctive features should enable us to refer to 
natural segment classes, we should require of a feature analysis that the distinctive 
features can characterize the segment inventories of the languages of the world. That 
is, all segments must be characterizable in terms of some unique combination of fea- 
tures. This requirement is quite self-evident: we don't want to end up with a list of features 
which cannot characterize the difference between [m] and [n], say. 

There is a third requirement placed on distinctive features. Consider first that there is 
no a priori reason to suppose that natural segment classes should consist of phonetically 
similar segments. Instead of [p,t,k], it might have been the case that it was [p,tf] which 
were aspirated in English. If more and more natural segment classes were found to be 
phonetically arbitrary, we might be led to believe that a distinctive feature was an 
abstraction, i.e. that it could be given no definition other than in terms of the collection 
of segments that had it. To continue the hypothetical example, we would say that [p,tf] 
are [+delta], and that [+delta] consonants are aspirated in some context. In reality, we 
see time and again that the segments in natural segment classes are phonetically 
similar. This has led to the requirement of the NArURALNESS Conpition (Postal 1968: 
73), according to which distinctive features must have a phonetic (articulatory or 
acoustic) definition. Notice that the relationship is not the other way around. We need 
not ~ should not - postulate a distinctive feature purely because there is some phonetic 
property that a group of segments has in common. As Kaye (1989: 27n.) puts it: ‘One 
could group sounds according to the total energy involved in their production, the 
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number of different muscles involved in their articulation, their length in milliseconds, 
the distance an articulator moves from some predefined neutral position, and so on. 
Obviously, none of those definitions corresponds to a natural segment class. 

The three requirements we must impose on a distinctive feature system, there- 
fore, are that 


1. They should be capable of characterizing natural segment classes. 


2. They should be capable of describing all segmental contrasts in the world’s 
languages. 


3. They should be definable in phonetic terms. 


5.3 FEATURE VALUES 


Features may or may not have values. A binary feature either has the value‘+’ or the 
value ‘—’. The claim here is that both the group of segments that has the minus value 
and the group that has the plus value form natural classes. For instance, the assump- 
tion of the binary feature [+ voice] implies that languages refer to groups of voiceless 
segments as well as to groups of voiced segments. Ever since they were first proposed 
in Jakobson et al. (1952), distinctive features have standardly been assumed to be 
binary. In recent years, phonologists have proposed univalent features (e.g. Ewen 
1995). In this case, reference can only be made to the class of segments that has the 
feature, not to the collection of segments that does not possess it. Other terms for uni- 
valent are unary, single-valued or privative. For instance, the feature [LABIAL] allows 
reference to the group of labial segments, but we cannot express any generalization 
involving all non-labial segments. When assuming a univalent feature like [Lasiat], 
therefore, a phonologist makes the claim that no language ever refers to the class of 
non-labial segments. (Multivalued features are no longer used. One that was common 
in the heyday of SPE was the feature [n stress], where n was a number to indicate the 
degree of stress of the vowel that was specified with the feature.) There is no reason 
why all features should be of the same type. In the next section we will introduce a set 
of distinctive features, some of which are binary and some univalent. 


5.4 A SET OF DISTINCTIVE FEATURES 


The following set of features represents a modified version of the set introduced 
by SPE. It largely follows Halle and Clements (1983), but has also been informed 
by Sagey (1986). The features come in the following groups: major-class features, 
which classify segments into segment types like ‘vowel’ and ‘obstruent’; laryngeal 
features, which specify the glottal properties of the segment, manner features, 
which specify the type of constriction, or more generally the manner of articulation; 
and place features, which encode the place of articulation. 
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5.4.1 Major-class features 


There are three major-class features, [+consonantal], [+sonorant] and 
[+ approximant]. 


1. [+consonantal]. [+cons] segments have a constriction somewhere along 
the centre line in the vocal tract which is at least as narrow as that required for 
a fricative; [一 cons] segments lack such a constriction. Thus, [十 cons] are 
plosives, affricates, fricatives, nasals, laterals and [r], while [一 cons] are vowels, 
glides like [v,w,j], and - because their stricture is in the larynx rather than 
the vocal tract — [h,fi,?]. 


2. [+sonorant]. This feature distinguishes obstruents ([ 一 son]) from sonorant 
consonants and vowels ([+son]). [+ son] segments are produced with a con- 
striction in the vocal tract which allows the air pressure behind it and in front 
of it to be relatively equal, while this is not the case for [一 Son] segments. That 
is, either [— son] segments have an oral constriction which causes a significant 
increase in the air pressure behind it (e.g. [s], [d]), or there is no constriction 
in the vocal tract. Since the vocal tract does not include the larynx, [h] 
and [?] are [—son]. So [十 son] are all vowels, glides like [v,w,j], liquids 
and nasals, while [—son] are plosives, fricatives, affricates and laryngeal 
segments. ! 


How many (possibly overlapping) natural segment classes can be referred to with two 
binary features at one’s disposal? Hint: A natural class can be captured by one feature or 
by a combination of features. 

Use [+cons] and [+son] to characterize four classes, giving examples of segments for 
each class. 


Q32 In Dutch, there is a rule that places a [ə] between the noun stem and the diminutive 


ending [tja], as when [bal] ‘ball’ is affixed with [tja] and becomes [‘balatja]. On the 
basis of the following data, characterize the group of segments after which this [a] 
is inserted. 


balatja ‘ball’ dinki toj dinki tjtja ‘Dinky Toy’ 
komatja ‘bowl’ kap kapja ‘hood’ 
laxja ‘laugh’ bes befa ‘berry’ 
kanatja ‘jug’ rok rokja ‘skirt? 
karatja din dinatja ‘thing’ 
petja ‘cap’ sək . sokja ‘sock’ 


! The characterization of [h, ?] as [一 son] is controversial. It is sometimes assumed that they are [+son], as in 
SPE, Trommelen and Zonneveld (1983) and Halle and Clements (1983). 
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in the variety of Spanish as spoken in the state of Cordoba in Colombia there has 
been a process which assimilates the first of two adjacent consonants to the 
second, creating a geminate consonant (Charette 1989). For example, the word for 
‘door’, which is [pwerta] in Peninsular Spanish, is [pwetta] in the Cordoba variety. 
The process did not always apply. Study the following data and characterize the 
class of consonants that underwent the process. 


Later form 
seddo 
awto 


Earlier form 
serdo 

awto 

talko takko 
doktor dottor 

algo aggo 
neptuno nettuno 
fohforo fohforo 
magdalena maddalena 
ojgo ojgo 

arma amma 
ahno ahno 


Gloss 
‘pork’ 

‘car’ 

‘talc’ 
‘doctor’ 
‘something’ 
‘Neptune’ 
‘match’ 
‘Madeleine’ 
‘onion’ 
‘weapon’ 
‘donkey’ 


In Dutch, sequences of identical consonants are degeminated. These sequences 
will arise when the last consonant of one morpheme is the same as the first 
consonant of the next. However, not all such sequences are degeminated. 
Characterize the class of sounds that is subject to the rule in terms of distinctive 
features. 


Input Output 


xel lampja 
fis sapja 
le:xxa:tjə 
fra:j jaxt 
lø:k kint 
dəm mens 
ryv veir 
do:f fentja 
Xutko:p pak 
fein na:xaltja 
niv vontja 
kar radeifas 


xe: lampjə 
fi sapja 

le: xa:tja 
fra:] jaxt 
lø: kint 

dd mens 
ryv verr 
do: fentja 
Xutko: pak 
fei na:xaltja 
niv vontja 
ka radeifas 


Gloss 

‘yellow lamp’ 
‘nasty drink’ 
‘empty hole’ 
‘beautiful yacht’ 
‘nice child’ 
‘unwise woman’ 
‘stormy weather’ 
‘deaf fellow’ 
‘cheap suit’ 

‘fine nail’ 

‘new wound’ 
‘cartload of radishes’ 


*fra: jaxt 


*ry veir 


*ni vontja 
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In Dutch, certain consonants must be voiceless at the end of a syllable. Characterize 
that class of consonants with the help of distinctive features. 


PL Gloss SG PL Gloss 
liran ‘pulley’ ka:rt ka:rtan ‘card’ 
mepan ‘slap’ rant randan ‘edge’ 


speldan ‘pin’ bal balan ‘ball’ 
kantan ‘side’ 3i'raf 3i'rafan ‘giraffe’ 
reizan ‘journey’ vep veban ‘web’ 
doeyvan ‘pigeon’ pairt pa:rdan ‘horse’ 
vanan ‘cheek’ sxun sxunan ‘shoe’ 
kausan ‘sock’ le:v le:van ‘tion’ 


3. [approximant]. [+approx] are those segments which have a constriction 
in the vocal tract which allows a free (frictionless) escape of air, while for 
[一 approx] segments this is not the case (Ladefoged 1971: 46, Clements 1989). 
Vowels and non-nasal sonorants, like [I, 1, A], are [+ approx] segments. (The 
term ‘lateral’ is used for any l-type sounds, while the term ‘rhotic’ refers to any 
r-type sound; laterals and rhotics are often referred to as ‘liquids’.) 


Copy the following hypothetical segment inventory and draw lines that separate 
(a) [一 Son] from [+ son], (b) [—approx] from [+approx] and (c) [+ cons] from 
[-cons]. 


Q37 In English, certain segments may appear after [sp, st, sk] in the same syllable. 
Characterize them with the help of a single distinctive feature, on the basis of the 


following data. 
skip sprer stju: ə'sparə 
stop skju:d skla’rausts skræp 
skwod sprein spju: strert 


staek stro: ‘splendid skal 
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The difference between vowels and glides like [j,w] is commonly assumed to be a 
matter of phonological structure, rather than of phonological content. That is, [u] is 
different from a labialvelar approximant [w] not because these two segments have dif- 
ferent features, but because they occupy different positions in the syllable: the peak of 
the syllable for [u], and the margin, usually the onset, for [w]. Chomsky and Halle 
(1968) used the feature [+ syllabic] to express this difference. In later work syllabicity 
of segments is expressed directly in a representation that includes syllable structure. 
We will use the feature informally to refer to the difference between ‘syllabic’ (i.e. occu- 
pying the syllable peak) and ‘non-syllabic’ segments. 


5.4.2 Laryngeal features 


There are three laryngeal features, [+ voice], [+spread glottis] and [+ constricted 
glottis]. 


1. [voice]. [+voice] are segments for which the vocal cords are close enough 
together to allow vibration, while for [一 voice] this is not the case. Thus, 
[+voice] are vowels (e.g. [i,a,č,a1]), sonorant consonants (e.g. [m,n, 
l,r,R,W], voiced obstruents (e.g. [b,z,y,d3] and [fi]), while [— voice] are voice- 
less obstruents [(e.g. [p,9,f,ts,h]). 


2. [spread glottis]. | +spread] segments have a vocal cord configuration that 
produces audible friction in the glottis, while [— spread] segments lack such 
a configuration. Thus, aspirated segments like [phkwh] and [h,fi] are 
[+spread], while other segments are [—spread]. 


3. [constricted glottis]. For [+ constr] segments the vocal cords are tense and 
drawn together, while for [—constr] segments this is not the case. Thus |?], 
laryngealized vowels (e.g. [u]), and laryngealized sonorant consonants (e.g. 
[m?]), glottalized obstruents (e.g. preglottalized [°p] or ejective [p’]) are 
[+ constr]. So are implosives ([6,d,g]). Other segments are [一 constr]. 


American English [p,t,k] are accompanied by a glottal closure when appearing in 
the syllable coda, as in sit, atlas, popcorn, duckpond. What feature specification do 
these plosives acquire in this context? 


In Southern Oromo, a rule of i-EPENTHESIS inserts [i] between the ejectives [t’, tf] and 
a following [t,n], as shown in (1a). However, [?], [d ] and ft] do not trigger the rule, but 


undergo other changes that are not relevant here, as shown in (1b,c,d) (Lloret 1995). 
What combination of features distinguishes the [t’] from the oe consonants that 
do not trigger i-EPENTHESIS? 


(1) a. fit’-na fit’ina ‘we finish’ 
b. a?-na a:na ‘we push’ 
c.  ferd-ta fe:t:a ‘you want’ 
d. bit-ta bit:a ‘you buy’ 
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5.4.3 Manner features 


There are four manner features, [ + continuant], [nasal], [+ strident] and [+lateral]. 


L 


[+continuant]. [+cont] segments lack a central occlusion in the vocal tract, 
while [一 cont] segments are produced with such an occlusion. Thus, plosives 
(e.g. [p,d,g]), nasal consonants (e.g. [m,n]), affricates (e.g. [tf]) and laterals 
(e.g. [I]) are [—cont], other segments are [+ cont]. Some languages apparently 
treat laterals as [+ cont], which is phonetically understandable in the sense 
that while these segments have a central occlusion, they have a lateral aperture. 


[nasal]. [十 nas] segments (e.g. [m,n,y]) are produced with the velum (‘soft 
palate’) lowered, [ — nas} segments have the velum in its closed (raised) position. 
Nasal consonants and nasalized vowels are [+ nas], other segments are [一 nas]. 


[+strident]. [+strident] is relevant for obstruents only, and refers to a type 
of friction. [+strident] segments cause a noisier kind of friction than 
[一 strident] segments. [十 strident] voiceless fricatives are [f,s,J,x], [—stri- 
dent] ones are [,0,¢,x]. Together with [coronaL] (section 5.5.2), the feature 
can be used to capture sibilants ([s,z,J,3,t[,d3]), needed for a correct descrip- 
tion of the context for English I-INsERTION. Languages for which a contrast 
between [f,v] and [中 ,B] has been reported include Ewe and Venda (Ladefoged 
and Maddieson 1996: 140). The name Ewe forms a minimal pair with the 
word for ‘two’, [èvè], in that language, while English contrasts [s,z] with [0,0], as 
in sigh, xi (the Greek letter &, [zat]), thigh, thy. The feature’s other task is to dis- 
tinguish plosives from affricates, both of which are [一 son, —cont]. Such con- 
trasts are common, as in German [tol] ‘mad’ vs [tsol], English [tarm] time vs 
[tfaim] chime, or Corsican ['jalu] ‘freeze + 1sc’ vs [‘d3alu] ‘yellow’. The repre- 
sentation of affricates is controversial. A widely supported view, however, is 
that they are [+strident] plosives (Rubach 1984, Clements 1999). For one 
thing, it is at least suggestive that affricates typically have strident friction after 
the release of the closure, as in [pf,ts,t{,ky] rather than [pg,t0, cc,kx]. 


In Scottish English, [i, e, a, 0, u, A, Ai] are pronounced as long [i:, e:, a:, ov us, A ae] 
in open syllables. The tong vowels (including [ae]) also appear before certain 
consonants. Characterize the class of consonants before which the long vowels 
appear. This regularity, described in Aitken (1981, 1984), has become known as 
Aitken’s Law. 


writhe mail mile 
nine be:3 beige 


tease rod road 
love ka:r car 
Leith hom home 
tea laif life 
peace fud food 
mail raf rash 
fire boar 
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Q41 In Turkish, the [voice] contrast in obstruents is neutralized in the syllable coda in 
favour of the voiceless member, as shown in (1) (after Kim 1997). 


(1) UR Objective Plural 
ip ipi ipler ‘rope’ 
dib dibi dipler ‘bottom’ 
at atw atlar ‘horse’ 
ad adw atlar ‘name’ 
køk køki køkler ‘root’ 


999 gøgi gøkler ‘sky’ 
dif difi difler ‘tooth’ 
kuz kwzw kwzlar ‘daughter’ 


deniz denizi denizler ‘sea 
satf satfw satflar ‘hair’ 
a:d3 a:d3w a:tflar ‘tree’ 
Which obstruents are devoiced? 


Can the group of obstruents that is subject to the neutralization be characterized as a 
natural class? 


4. [+lateral]. [十 lat] segments have a central tongue contact in. the oral cavity 
with one or both sides of the tongue being held away from the roof of the 
mouth, allowing the air to escape there, like alveolar [l] and prepalatal [A]. 
Other sounds are [一 lat].A lateral escape of the air is also possible for obstru- 
ents, like the lateral fricatives [4] (voiceless) and [k] (voiced) and the lateral 
affricates [tł] and [db]. 


5.4.4 Ambiguity and nonspecification 


In this section we further illustrate the kind of reasoning by which featural analyses of 
segments are arrived at. As you will recall from section 5.2, the first and foremost 
motivation for the definition of a feature is that it allows natural classes to be referred 
to. The aim of the exercises in this chapter so far has been to show this. It is possible, 
of course, that there is conflicting evidence for the inclusion or exclusion of a particu- 
lar segment in a natural class. This is the case with [1], which behaves ambiguously 
vis-a-vis the feature [cont]. We will conclude that languages may differ in the 
feature value of [continuant] for [1]. Second, it is to be expected that certain (classes 
of) segments are not specified for all features. That is, features may be irrelevant 
for certain segments. Thus, vowels will not be specified for [lat], simply because 
those segments cannot be differentiated with the help of this feature. Less obviously, 
it appears that [h] does not participate in rules referring to [+ cont]. In the section 


below, we will argue that laryngeal segments are not specified for either manner or 
place features. 
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The ambiguous behaviour of [I] 


If you did Q40 right, you will have drawn the conclusion that in Scottish English, [1] 
is [—continuant]. In other languages, however, the same segment may have to be 
analysed as [+continuant]. In Frisian, vowels are nasalized before [n] in the same 
syllable, provided a [+cont] consonant follows. The [n] itself is subsequently lost 
(Tiersma 1985). From the data in (1), which consist of infinitival verb forms prefixed 
with [in-] or [oan-], it is clear that the [1] must be [+ cont]: the group of consonants 
that does not allow the change to go through are [p,t,k,g,n], while the consonants that 
do allow it are {s,f,j,v,r,]]. 


(1) in-pake impaka ‘to wrap up’ 
oan-treka oantreko ‘to take to heart 
oən-komə oənkomə ‘to arrive 
ingran ingion ‘to enter 
oon-nima oannime ‘to accept 
oan-stion oastian ‘to please’ 
in-fola ifola ‘to fall in’ 
in-jain ijain ‘to give in 
in-véja Ivéjo ‘to live with one’s parents’ 
oan-ropa oõropə ‘to call’ 
in-liza iliza ‘to preserve’ 


The feature specification of laryngeals 


The Frisian rule of NASALIZATION may also give us an indication about another ques- 
tion that concerns the feature [continuant]. The laryngeal consonants are [—cons, 
—son]. If we take the definitions of these features seriously, this means that [h,?] have 
a constriction in the larynx, but have no constriction in the vocal tract, and thus do not 
have the sort of constriction that sonorants like [j] have. That is, they do not have a 
constriction in the vocal tract that is at least as narrow as used by fricatives. The ques- 
tion therefore arises whether it makes sense to want to specify [h,?] for manner fea- 
tures or for place features, since after all, if there is no stricture, how could we specify 
either its manner or place? While this makes good phonetic sense, the first question is 
whether languages make reference to any manner or place features of glottal conso- 
nants. For example, we might find rules that refer to sets of the type ‘[h] and other 
(classes of) [+continuant] segments’. One such set is ‘all vowels, the approximants 
(1,1,j,w] and all fricatives, including [h]’, which would be the class [十 cont]; another is 
‘[Lrj,w] and all fricatives, including [h]’, which would be the class [十 cont, —syl]. As 
we have just seen, Frisian NASALIZATION is a rule referring to the class continuant conso- 
nants, but, interestingly, [h] is not among them, as shown in (2). If [h] were specified 
for manner features, that is, if [h] were [十 continuant], we would expect the rule of 
nasalization to apply before it. Since nasalization does not apply before [h], we have to 


conclude that this segment is not specified for manner features and hence cannot be 
considered [+ continuant]. 
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(2)  inhelja ‘to hold in 
oanhiara ‘to listen to 


A Dutch assimilation rule points to the same conclusion. PROGRESSIVE DEVOICING devoices 
fricatives after obstruents, as shown in (3). 


(3) ze: ‘seg op se: ‘at sea’ 
3ur'nail journal’ dit furna:] ‘this journal’ 
vyir ‘fire’ ‘kampfy:r ‘campfire’ 
yas ‘gas dit xas ‘this gas’ 


The class of voiced fricatives could be referred to by [—son, +cont]. Clearly, if [h] is 
[+ cont], this consonant would be included in that natural class. As it happens, the 
Dutch glottal fricative is frequently voiced. This voiced [fi] shows up after obstruents, 
too: stadhuis ‘town hall’ may be pronounced [stat'ficeys] as well as [stat haoeys] 
(Rietveld and Loman 1985). This means that Dutch [fi] is not subject to PROGRESSIVE 
DEVOICING, and that it therefore cannot be [+ cont]. We now have evidence from two 
languages that [h] is not specified for continuancy. With McCarthy (1988) we will 
assume therefore that laryngeal segments are not specified either for manner or for 
place features. 


5.9 PLACE FEATURES 


In the previous section, a set of binary features was introduced which are capable of 
specifying the major class, the state of the glottis and the manner of articulation of con- 
sonants. In this section we deal with the features used to specify the place of articulation 
of consonants and the tongue position of vowels. Among the place features, there are 
four univalent features specifying the major areas of articulation. These are the features 
[LABIAL], [coronar], [DORSAL] and [RADICAL]: a segment either has the feature or it does not. 
This implies that, just as laryngeal segments are not specified for a number of features 
(those specifying either manner or place of articulation), not all segments will be speci- 
fied for all the place features: a consonant which is not coronal will not have the feature 
[corona]. Binary place features will be used to characterize the place distinction within 
a major articulator area. The feature [+ round], for example, will be used to specify seg- 
ments that are articulated with the help of the lips, i.e. are [LABIAL]. Segments whose 
articulation does not involve any activity of the lips will thus be neither [十 round] nor 
[— round]: they have no specification for that feature. 


5.5.1 Labial 


[LABIAL] segments are articulated with the lips, like [f,p,m], or - in the case of vowels — 


are formed with lip rounding, like [y,o,ce]. Segments that are [LABIAL] may be specified 
for [+ round]. 


1. [+round].[+round] segments have lip rounding, like [pw,tw,o,u5]; [一 round] 
segments do not. In rare cases, unrounded and rounded labial segments contrast, 
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as in Margi, Nambakaengo and Kilivila (Senft 1986). Labialized segments like 
[t"] will be discussed further in Chapter 12. 


5.5.2 Coronal 


[corona] segments are articulated with a raised crown of the tongue, i.e. a raised tip 
and/or blade, ranging from a dental [6] to a prepalatal [j]. Examples of [coronat] seg- 
ments are [t,z,1,6,j,J,n.r]. [coRoNAL] segments are further specified for the features 
[anterior] and [+distributed], and in the case of coronal fricatives and affricates 
also for [+strident]. 


[+ anterior]. For [+ ant] segments, the crown articulates with the alveolar ridge 
or somewhere further forward, while for [-ant] segments, the crown articulates 
with a point behind the alveolar ridge. Thus, [t,s,0] are [+ ant], while prepalatal 
or postalveolar and retroflex consonants (e.g. [f,3,,¢,j] and [t,d,3,n,|]) are 
[~ant]. 


2. [+distributed]. Segments that are [+distr] are produced with a constriction 
that extends for a relatively great distance along the vocal tract, while for [—distr] 
segments this is not the case. Thus, consonants produced with the tip of the 
tongue (apical consonants like British English [t,d,n]) are [— distr], as are [s,z] 
(Clements 1985). Blade-articulated (laminal) consonants like [f,tf,3]) are 
[+distr]. Dental consonants like [0,ð,1,t] are also [+ distr], because even where 
itis only the tip that touches the front teeth, the blade is close to the alveolar ridge 
and in fact contributes to the acoustic effect. Retroflex consonants are [ — distr]: 
the tip articulates with the part of the palate immediately behind the alveolar 
ridge. Australian languages frequently have a four-way opposition, utilizing the 
four possibilities given by these features. Four coronal stops and nasals contrast 
in Kayardild, for instance, as shown in (4) (Evans 1995).” 


(Lamino-) (Apico-) (Lamino-) Retroflex 
dental alveolar prepalatal 
c fl 


5.5.3 Dorsal 
[porsaL] sounds are articulated with bunched dorsum: [k,g,y,p] (velars), as well 


as [¢,k] (fronted velars) and uvulars (e.g. [y,q]). In addition, all vowels are [porsat]. 


? This predicts that no language contrasts dental stops, which are [十 ant +distr], with laminally produced 
alveolar stops, which are also [+ant, + distr], which would seem to be correct. 
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[porsat] segments are further specified for a set of features that specify just where the 
bunch of the tongue body is located, the tongue body features. 


1. [+high]. Segments that are [十 high] raise the dorsum to a position close to the 
roof of the mouth, while [一 high] segments do not. Thus, [ +high] segments are 
[i,1, y, Y, U, W, u], as well as [ç],and [k, g,x, y, 0], while [x,e, 0,a] are [—high], for 
instance. 


2. [+low]. [+low] segments have the bunched dorsum low in the mouth, while 
[—low] segments do not. Thus, [十 low] segments are [a, €:, 9:], for instance. 


3. [back]. [十 back] segments have the bunch of the tongue positioned in the 
centre or further back, while [一 back] segments have the bunch in the front. 
Thus, [十 back] segments are velar and uvular consonants (e.g. [k, y, X]) and 
vowels like [u,3 ,0, A,a], while [一 back] segments are fronted velars like [k] and 
[ç], and vowels like [i,y,g,e]. It is emphasized that [ç], although classed with the 
palatal consonants [c,j,n] in the IPA chart (i.e. with the [coronat] consonants), 
is interpreted as a fronted velar, i.e. a [porsat] consonant. Many languages have 
[ç] and [x] in complementary distribution depending on the backness of the 
preceding or following vowel. For instance, Greek ['¢é ri] ‘hand’ begins with the 
same phoneme as ['xaril] ‘charm (cf. Q26). 


4. [tense]. [+ tense] vowels like [i,e,a,o,u] are produced with a more peripheral 
and somewhat closer tongue position than their [一 tense] counterparts [1,£,0,£, 
u]. The feature is only relevant if the language has vocalic oppositions like [i - 1], 
ly - Y], [u - u], etc. It is commonly used in Germanic languages, which have con- 
trasts like English [su:t] suit— [sut] soot and German ['mi:tə] ‘rental fee ~ ['mita] 
‘middle’. The features [+ Advanced Tongue Root; ATR], used for instance in the 
description of the West African language Akan (Lindau 1978), and [+ Retracted 
Tongue Root; RTR], used for instance in the description of the Tungusic lan- 
guages of Siberia (Li 1996), may be seen as phonetic variants of this phonologi- 
cal feature. [+ ATR] involves a forward position of the tongue body, with 
concomitant enlargement of the pharynx, while [+RTR] involves a retraction 
and lowering of the tongue body, with concomitant narrowing of the pharynx. 
Akan has four plain vowels and five [+ ATR] vowels, three of the former type 
occurring in [—ATR] [ebus] ‘stone’, and three of the latter in [+ ATR] [ebuo] 
‘nest’. Baiyinna Orochen has nine plain and nine [十 RTR] vowels, and [—RTR] 
[olo:] ‘to cook’, for instance, contrasts with [十 RTR] [olo:] ‘to wade’. The features 
[+ tense], [+ ATR] and [+RTR], while phonetically somewhat different, appear 
never to co-occur in the same language (cf. Halle and Clements 1983: 7; 
Ladefoged and Maddieson 1996: 300). 


5.5.4 - Radical 


[RADICAL] (also [PHARYNGEAL]) sounds are articulated with the root of the tongue. A 
voiceless fricative [h] occurs in many varieties of Arabic, as does a pharyngeal approx- 
imant [{]. See Ladefoged and Maddieson (1997) for more information. 
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5.6 SOME EXAMPLES 


On p. 71 we give a table with feature values for a number of representative consonants. 
Notice that many segments have more than one place feature, and are technically ‘com- 
plex’. A round vowel like [u] is [LABIAL, 十 round] as well as [porsat, +back, +high], 
while a labialized [t] is both [coronat] and [LABIAL]. In Chapter 12 we will further dis- 
cuss the representation of complex consonants. 

A seven-vowel system like that of Italian comes out as in (5). It is illustrated in (6). 


[— round] [—round] [+round] 
Oe a a 
C e E a 
Crw e a 


mil:e ‘thousand’ a punio ‘fist 
E Se a CSET 
sempre ‘always’ skala ‘stairs 


As observed above, in many Germanic languages, the vowels divide into a lax set 
and a tense set. Standard German has the vowel system in (7), where the lax vowel is 
given on the left of each cell. Examples are given in (8). The bracketed vowel [æ:] has 
merged with [e:] in the speech of many speakers. Note that [g,o] are the lax counter- 
parts of [e:,0:] in German and Dutch. 


[—round] [+round] | [—round] [+ round] 
igor e [we [| oe 
[igs ot [ee [ew [fa 
ee da 


(6) 


[ig Fer e 


[stem [ove ewer [ [ae we 
Ce lel aM a r 


(8) 
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Some examples 


+ 十 十 
+ 十 tea 
十 十 十 十 

+ 1 

| 


SU0J 


re 
+ + 
+ 
十 
十 
十 
十 
十 
= 
一 
十 
-= 
+ 
= 
一 
4 
+ 
+ 


səZenZue] ssoide SILLA [1] 10} Duenuquoo] aznlesj IYI Jo anleA ayL 'aInleaj IYI 10} paUIads JOU st JULUOSUOI IY} PEYI NPLIIPUI 
SUeIg `A Aq payeoiput sı əmzeay 人 ITPUn e Jo aouasaid ayp ƏryM “— 10 + se payloads are sainqeay Areulg “sainye3y 9] 10} SIUeuosUoOD aAHP}UasaIdaI çz Jo sUOTRIYIDads ainjerg 
上 9 algel 
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: : T 
diinst ‘service | myido ‘tired’ n guit ‘good 
meir ‘more Jorn ‘nice eas! roit ‘ 


There are languages that contrast four vowel heights without employing a tense—lax 
contrast, of which Danish and Imonda are examples. The tongue-height features 
[thigh] and [+low] cannot characterize such systems. Of the four theoretically pos- 
sible combinations, the specification *[+high, +low] must be ruled out, because it is 
contradictory: the body of the tongue cannot simultaneously be raised and lowered. 
The Imonda vowel system is given in (9) (Seiler 1985). Clements and Hume (1995) 
present a feature framework that can account for such four-height systems (as well as 
for five-height systems, which have also been reported to exist). 


(bæir) 


5.7 REDUNDANT VS CONTRASTIVE FEATURES 


Frequently, feature specifications are predictable. This predictability is in part a con- 
sequence of the incompatibility of particular feature specifications. For instance, as 
we have seen in the previous section, the presence of | + low] in the specification of a 
vowel predicts that it will be [—high]: the tongue cannot be raised and lowered at the 
same time. The same goes for [+high]: it predicts [— low]. In large measure, however, 
the predictability is language-specific, and results from the fact that not all languages 
use the phonological possibilities to the full. Take the Turkish vowel system, for 
instance. This language employs all the combinations of [back] and [+round], but 
only has two distinctive vowel heights, as shown in (10), with examples in (11). 


[—back] [一 back] [+back] 
[— round] [+ round] [+round] 


[+back] 

[— round] 
TT a o a 
TEAREN 


Redundant vs contrastive features 73 


Cu o |o 和 了 2 


In this system the feature [+low] is entirely redundant, as one of its values, [—low], 
does not appear to be used at all. In other inventories, both values of a feature may 
occur, but one value may be predictable from other features. In the Italian vowel sys- 
tem (5), for instance, [+ round] predicts [+back]: all round vowels in this language 
are back. (This absence of front rounded vowels, incidentally, is a characteristic that 
Italian has in common with most languages in the world.) Or again, while in many 
languages obstruents come in pairs contrasting for [+ voice], like [p]-[b], [s]-[z], 
etc., sonorant segments like [m,l,r] are usually always voiced, with the result that 
in those languages [+son] predicts [+ voice]. This means that while in the class of 
obstruents the feature [+ voice] is contrastive (or distinctive), [voice] is redun- 
dant in sonorants: they are predictably [+ voice]. 

A large body of work, often referred to as underspecification theory, has been 
devoted to the question of whether redundant features should be included in the under- 
lying representation of morphemes, and if not, when they should be supplied. Just as in 
the case of the nonspecification of manner and place features in glottal consonants (see 
section 5.4.4), arguments for or against underspecification have been based on the sim- 
plifying effect that the absence of a feature has on the formulation of phonological gen- 
eralizations. We give one example here, from Steriade (1987). If we assume that [I] is 
[十 cont] in Latin, [+lateral] is distinctive in Latin within the class of liquids, [lr], or 
[+ approx, + cons]. In the class of nasals and obstruents ([—approx]) as well as in the 
class of glides and vowels ([—cons]), the feature is redundant. Steriade (1987) proposes 
that only contrastive features are specified in underlying representations. This means 
that [r] is marked as [一 lat] and [l] as [十 lat], but that in no other segment is there a 
specification for [+lat]. This assumption allows for an interesting description of the 
distribution of the alternants -aris and -alis of the adjectival suffix. Consider the forms 
in (12). 


(11) 


(12) a. nav-alis ‘naval’ 
crimin-alis ‘criminal 

b. sol-aris ‘solar 
milit-aris ‘military’ 

c. flor-alis ‘floral’ 


The forms in (12a,b) show that the alternation depends on the presence of the fea- 
ture [十 lat] in the base: if the base contains [十 lat],the form -aris is used, otherwise -alis. 
If we assume the underlying form of the suffix to be [alis], the [十 lat] [1] is seen to 
change into [一 lat] [r] after a base containing [十 lat]. (Such processes are known as 
dissimilations.) But now consider (12c). Although the base contains [+lat], the suffix 
is alis. It is here that the underlying [一 lat] specification of [r] comes in useful. Because 
other nonlateral sounds, like [t] of militaris, are not specified as [—lat], Steriade’s 
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proposal allows us to say that the last specification for [lat] in the base determines the 
specification for [lat] of the first consonant in the suffix. This is shown in (13). 


(13) [+lat] [一 lat] 
| | 
a. mil it-ar is 
[+lat] [—lat] [+lat] 


| | | 
b. fl or-a I is 


This particular conception of underspecification, in which features are specified 
only in segments that contrast for the feature concerned, is known as contrastive 
underspecification. The topic of underspecification is closely related to ones view of 
how segments are represented; it is taken up again in Chapter 11, after we have pre- 
sented a hierarchical representation of the distinctive features introduced in this 
chapter. There, it will appear that it is in fact more useful to include in underlying rep- 
resentations only one of the two feature specifications of contrastive features. 


Q42 in Corsican, some vowels are nasalized before a nasal consonant in the same 
syllable (Agostini 1995). How would you characterize the class of vowels that 
undergo this nasalization? 


‘printfipe 


‘prince’ 


‘anku ‘also’ 
‘ponte ‘bridge’ 
'fungu ‘mushroom’ 
'kõntu ‘account’ 
funda'mentu foundation 


Q43 In Bengali, there is an optional rule which deletes [r] before certain consonants, 
allowing the consonant to geminate. Characterize the class of consonants that 
trigger r-DELETION on the basis of the following data (Hayes and Lahiri 1991). 


Input Output Gloss 

barfa baffa ‘rainy season’ 
mithu-r fari mithu-Jfari ‘Mithu’s sari’ 
ram-er bari ram-er bari ‘Ram's house’ 
ram-er taka ram-et taka ‘Ram’s money’ 
fon-korbo fon-korbo ‘will telephone’ 
fundor dardza Jundod doddza ‘beautiful door’ 
bordi boddi ‘elder sister’ 
brorti brotti ‘full’ 

kor-lo kol-lo ‘do-3rd FUT.’ 
kor-t 0 kot-to ‘do-3rd PAST’ 
matf*-er na:k matf*-en na:k ‘fish’s nose’ 
matfh-er mata matf-er mata ‘fish head’ 


Redundant vs contrastive features 75 


Q44 A number of nouns in the Dutch dialect of Wehl form their plurals by changing the 
last full vowel of the stem, a process known as UMLAUT (Haan 1996). What feature is 
involved in this process? 


Singular Plural 

zk zek ‘sock’ 
ynt yræ:t ‘fishbone’, 
VUS VYS ‘fox’ 

ho:p he:p ‘heap’ 


045 


In Luganda, [r] and [I] occur in complementary distribution (Chesswas 1963). 


olubiri ‘palace enclosure’ akalulu ‘vote’ 

linna ‘climb’ : engiri ‘warthog’ 
ssaffaali ‘safari’ enkula ‘rhinoceros’ 
erang ‘dye’ akasaale ‘arrow’ 
akasolja ‘roof’ olumuli ‘reed’ 
kampala ‘Kampala’ liiri ‘silk’ 
omulere ‘flute’ akabonero ‘sign’ 
omulenzi ‘boy’ weeraba ‘goodbye’ 


luma ‘to hurt’ lje ‘my (poss. Class V)’ 


What determines their distribution? List the contexts in which each allophone occurs. 
Which of these two contexts is statable in terms of distinctive features? 


Which of the segments would you choose as the underlying one? Please motivate your 
answer. 


Q46 Use minimal numbers of distinctive features to characterize the five natural 
segment classes within the segment inventory of Telugu. 


ph th t Oe kh 
bf Pi MSc 90; Mei Gruo Of 
$ 
n 
l 


j 
e O a 


ph bf th ds fh qf kh gf 
thd d fds tt d 
b bf m 
wjieaou 

ao 
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Q47 Use minimal numbers of distinctive features to characterize the five natural 
; segment classes within the segment inventory of Amharic (Hayward and 
Hayward 1992). 


ES COC ROR Re 

d } g g” 

righ ss | rene 
ey 3 

n 了 0 

| 


一 . 


u 
0 
a 


ii 463 253 
CIPE tes: 
co ys3 

np 
uo 


as SS ee 


5.8 CONCLUSION 


Segments can be analysed into collections of univalent and binary features. A suc- 
cessful feature analysis will characterize the segmental contrasts that are used in the 
languages of the world, while it will additionally allow natural feature classes, groups 
of segments that figure in phonological grammars, to be referred to. We have seen that 
segments may be unspecified for certain features, either because the feature is irrele- 
vant for the class of segments concerned, like [strident] in the case of vowels, or 
because the feature is univalent and the segment does not have it, like [Labia] in the 
case of [t]. We have also made a distinction between contrastive and redundant fea- 
tures, where the former are features that, in some class of segments in some language, 
characterize contrasts, like [+ voice] in a language with voiced and voiceless obstru- 
ents, and the latter are not, like [+ voice] in a language that only has voiceless obstru- 
ents and voiced sonorant consonants and vowels. It is often assumed that redundant 
features do not appear in underlying representations. 

In Chapter 11, we will see that segments are not simply lists of features. Because 
features tend to be active in specific groups, such as the place features or the laryngeal 


features, the assumption has generally been made that features are grouped into 
constituents. 
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Ordered rules 


6.1 INTRODUCTION 


Phonological rules are formal expressions that describe changes in the phonological 
representations of words. As a result of the application of a phonological rule, a seg- 
ment may be inserted or deleted, or one or more of its feature values may be changed 
(to mention the most frequent types). In this chapter, we first deal with the formal 
notation for writing the rules introduced by Chomsky and Halle (1968) (SPE). Next, 
we will see that the assumption that rules are ordered, whereby the output of one rule 
serves as the input to the next, allows for simple phonological generalizations. Third, 
we discuss the ELsEwHERE CONDITION, a convention that allows us to omit the inclusion 
of exceptions in a general rule, and exemplify the descriptive effects of different types 
of rule ordering. Finally, the concept of opacity is discussed from a rule-based and a 
constraint-based perspective. 


6.2 SPE REPRESENTATIONS 


In the representation proposed by SPE, segments are lists of feature specifications. 
Such a list is also referred to as a feature matrix. In the original proposal, all features 
were binary, and there were no dependencies between features. Strictly speaking, 
therefore, (1a) is the SPE feature matrix of an apical [t]. In (1b), we give the represen- 
tation with the univalent place feature [corona], which was introduced in Chapter 5. 
(In this book, representations with binary major place features, as in (1a), will not be 
used.) Note that the status of an expression like ‘(t]’ is that of a shorthand notation for 
the corresponding feature matrix. 

A morpheme is represented as a string of feature matrices. The beginnings and 
ends of morphemes are indicated by boundary symbols. What may, with hindsight, 
be seen as puzzling is that the only boundaries recognized by SPE are morphosyntac- 
tic boundaries. The symbol + was included in representations at the boundaries of 
word-internal morphemes (morpheme boundary), while # served as the word 
boundary. Thus, (2) is the representation of the word pens. In much of the literature, 
word-internal morpheme boundaries are represented by means of a dash, as in 
[pen-z] pens, rather than with +. 


1 


(1) a. +cons b. +cons 
—son —son 
—voice —voice 
—spread —spread 
—constr —constr 
—cont —cont 
—nas —nas 
—lat —strid 
+cor —lat 
十 ant CORONAL 
一 distr 十 ant 
一 round —distr 
—low 
—tense 
(2) tsyl 
—syl —cons —syl —syl 
+cons +son +cons +cons 
—son +voice +son —son 
—voice —spread +voice +voice 
—spread —constr —spread —spread 
—constr 十 cont —constr —constr # 
—cont 一 Das —cont +cont 
—nas —lat 十 nas —nas 
—lat DORSAL —lat —lat 
LAB 一 back COR COR 
—rpimd —low +ant +ant 
+distr —high —distr —distr 
—tense 


Sentences are represented as strings of words. The position in SPE was one in which 


the morphology and syntax preceded the phonology: only when the words are inserted 
into the sentence will the phonological rules be called upon to make the necessary 


adjustments. In (3), a ‘shorthand’ representation is given of the sentence Pens leak. 


(3) #pen+z##litk# 


6.3 SPE RULES 


The general format of an SPE rule is as given in (4). 
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(4) [Minimal 
feature Minimal Minimal 
Specification | Features | specification specification 
of segment(s) that change of lefthand | 一 | of righthand 
undergoing context ; context 
the change 


The information to the left of the arrow is the rocus of the change, that between the 
arrow and the slash is the srrucTURAL CHANGE (SC), while the information to the right of 
the slash is the context. The focus plus the context is known as the STRUCTURAL DESCRIP- 
TION (SD). Rules are assumed to apply within words, and if a rule is to apply across 
word boundaries, the SD needs to include the symbol #. Rules are assumed to apply 
across +-boundaries, even if no + is specified. (If a rule applies only at a 十 -bound- 
ary, this must of course be included in the SC.) Rules written in this format were stan- 
dard in phonological work from the late 1960s until the late 1970s, when the 
representations of Autosegmental Phonology introduced new ways of formulating 
rules (cf. chapters 9 and 10). 

To illustrate, we give (5) as the rule that devoices voiced obstruents at the word-end 
in Dutch. 


(5) FinaL DEVOICING [—son] > [—voice]/__ # 


The rule scans the feature matrices from left to right for the presence of [—son]; 
then, if it finds one, it checks whether there is a # on the right, and if there is, it 
specifies — as the value of the feature [voice] in the matrix with [— son]. In many cases, 
the obstruent will already be | — voice]. The rule is then said to apply vacuously: the SD 
is satisfied, but application of the rule does not bring about a difference. The reason for 
doing it this way, of course, is that the rule can be written with fewer terms than if we 
were to add the feature [+ voice] to the focus of the rule. 

The obstruent undergoing rule (5) frequently appears in word-final position. 
However, sometimes an obstruent needs to be devoiced that is not adjacent to the 
word boundary. For instance, when the 3sg suffix [t] is attached to the verb stem [le:z] 
‘read’, the surface result is not *{le:zt], but [le:st]. In the SPE representations, this 
means that we must express in the rule that other obstruents may intervene between 
the obstruent to be devoiced and the word end. This is done by adding the term ‘Co 
before the #. The C itself is shorthand for [—syll]. (The symbol V is similarly used as 
a shorthand for [+syll, —cons], i.e. a ‘true vowel.) The subscripted number 
nmeans‘n or more (instances of the symbol), just as a superscripted digit means‘n or 
less’. The term ‘C,’ therefore translates as ‘zero or more consonants’, and has the effect 
of allowing rules to ignore consonants in the positions in which it is used. Our rule 
therefore now looks like (6). 


(6) FinaLDevoicinc [—son] > [—voice]/__ Co# 


Version (6) would be able to apply twice in the same word. When shifting through 
the form from left to right, the rule will find that its SD is in fact met in a form like 
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[le:z-t]. Similarly, it is met twice in a case like [ho:vd] ‘head’, whose singular and plural 
forms are {ho:ft] and ['ho:vdon], respectively. The first focus is [v], when [d] corre- 
sponds to Co, and the second is [dj, when Cy corresponds to no consonant. 


| Q48 The Dutch past participle is formed by prefixing [xa] and suffixing [d] to the verb 
stem. The suffix shows up in its underlying form when the inflectional suffix [ə ] is 
added, as in [xa + vyl + d + ə} ‘filled-PARTIC-OBL’, but otherwise shows up as [t]. 
In the case of a verb stem like [sxyd], the uninflected [xə + sxyd + dj may be 
formed, and is pronounced [xasxyt]. Does rule (6) apply to both underlying [d]s in 
this last form? Motivate your answer. 


6.3.1 Reference to the syllable 


Dutch FINALDEVOICING applies to obstruents in the syllable coda, not just to the smaller cat- 
egory of word-final obstruents. This is clear from the way speakers of Dutch treat English 
words like Sidney ['stdni}, business [“biznis}: when they are used as loans, English sylla- 
ble-final [dz] are replaced with [t,s], respectively (Booij 1995). Venneman (1972), 
Hooper (1976) and Kahn (1976) (re)introduced the syllable in phonological representa- 
tion. Reference to the syllable makes it possible to give a formulation of FINAL DEVOICING 
which takes these additional facts into account. This is done in (7), where the o-labelled 
parenthesis is used to indicate the syllable boundary. In Chapter 10 we will deal more 
extensively with the syllable in the structural descriptions of rules. 


(7) FinaLvevoicinc [一 son] 一 [— voice] /__ Cy), 


6.3.2 The brace 


A notational device that has been widely discredited as a theoretical element, but 
which is often used out of convenience, is the brace. The brace notation is used to 
express a disjunction between two or more terms (‘either ... or’), and is thus found in 
rules that are partly identical. Again, when no reference to the syllable is made, the 
syllable-final context can usually be captured by saying ‘before a consonant or the 
word-end’. (Even this characterization frequently fails to capture the right context, as 
explained in Chapters 10 and 15.) Rule (8), for instance, nasalizes vowels before a nasal 
followed by another consonant or the word-end. Rule (8) would change French [bon] 
good-MASC and [bonte] ‘goodness’ into [b5n] and [bSnte], respectively. 


(8) V—>[+nas] /— [+nas] {¢ 


Q49 The symbol ð is used to the left of the arrow in the case of insertion, and to the right of 
the arrow in the case of deletion. What would be the prose version of (1), for instance? 
And of (2)? 


(1) oat 
(2) roae 


SPE rules 8} 


6.3.3 Variable feature values 


To express assimilations, feature values need to be made to agree. For instance, in 
Turkish, high vowels in suffixes agree for [back] and [round] with the preceding vowel 
in the word, as shown in (9). 


(9) Nominative Possessive Gloss 


koj køjy ‘village 
kep kepi ‘cap 

at atw ‘horse 
son sonu ‘end’ 


A rule to achieve this would use variable feature values, expressed with the help of 
Greek letters. Rule (10) says that high vowels agree in backness and roundness with 
the preceding vowel: œ could be — or +, and so could B, independently. 


(10) [ +syil 5 aback I aback C 
+high Bround Bround OF 


6.3.4 Parentheses 


The parenthesis notation is used to include optional elements in rules. Dutch has a 
rule of REGRESSIVE VOICING which applies within words as well as across word bound- 
aries. Within words, it applies to [lif-da] ‘dear-ness, love’ to form [livda], and across 
words it applies in [lif ##di:r] ‘dear animal’ to form [liv di:r]. Since SPE assumes that 
every word that leaves the morphology has #s around it, two #s need to be specified in 
the sp. However, because the rule also applies within words, they need to be put in 


parentheses to indicate that they may, but need not, be present in the representation, 
as shown in (11). 


+voice 
(11) [一 Son] 一 [+voice]/ _ (# #)| 一 cont 
—son 


6.3.5 The transformational rule format 


The focus of the rule format illustrated above is confined to only a single segment. 
However, there are processes that affect more than one segment. For example, METATHE- 
sis is a process that switches round two segments, as can be seen in the Old English 
word for grass, which varied between [gras] and [gers]. To be able to refer to changes 
involving more than one segment, the transformational rule format was used. It lists 
the relevant string of segments and boundaries to the left of the arrow, and repeats that 
string, with the SC; to the right of the arrow. The segments of the context are not liter- 
ally reproduced, but identified with the help of digits. Rule (12) says: Delete a coronal 
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nasal before a consonant or at the word-end, and nasalize the vowel that precedes iť. 
It would change French [bon] and [bonte] into [b5] and [bõte], respectively. 


+cons 
[+syll] | +nas # 一 1 
(12) COR C [+nas] Ø 3 
1 2 3 


k i u 
g e 0 
X ao 
Y 
n 
Ir 
w j 
a- 
[i> onom # 
—low 
Qe +syl 
Ss an ahigh 
ahigh sy Blow 
Blow 
—cont +cont 
十 
(3) 0 一 —voice / on — | —voice # 
COR COR | 
ant ant 


.  Obstruents are voiceless after word-internal or word-external obstruents. 
2. An {a] is inserted between [r,l] and a labial obstruent in the same word. 

3.  Suffix-initial vowels are front if the preceding syllable contains [i], and back if the 
preceding syllable contains [u]. 


6.4 LINEAR ORDER 


Obviously, if in some language there were only one rule, its input would consist of 
the lexical representation of the forms that meet its structural description (the underly- 
ing representation, UR), while its output would correspond to the actual pronunciation 
(the surface representation, SR). Languages generally have more than one rule, and the 
question therefore arises of how phonological rules apply: in sequence or simultane- 
ously? Let us assume, for the sake of argument, that rules apply simultaneously to the 
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underlying representation. Any rule would always scan the underlying representation to 
see if its structural description was met, and if it was, the change would be made. This 
mode of application is known as simultaneous rule ordering. Another possibility 
would be that the output of one rule is taken as the input of another rule. In this situa- 
tion, you would have to know in which order the rules applied. This option is known as 
linear rule ordering. How do we decide between these two possibilities? As always, the 
answer depends on which theory provides the more economical description of the facts. 
Of course, the ordering of two rules is relevant only if the order in which they apply 
makes a difference for the forms that are produced. 


Q53 in many varieties of Dutch, coda clusters consisting of a liquid and a consonant other 
than [t,d] are broken up by [a], causing [kerk] ‘church’ and {kalf] ‘calf’ to be [kerak] and 
[kalaf] on the surface, respectively. They also have a rule deleting [n] after [ə], if these 
segments occur in the same syllable. Demonstrate that the order in which n-DELETION 
and sa-INSERTION apply makes a difference for the SR of a word like [va:rn] ‘fern’. 


Q54 The following are the underlying forms of a number of words in Tonkawa (from 
Phelps 1975). 


netale-o? ‘he licks it’ 
we-netale-o? ‘he licks them’ 
netale-n-o? ‘he is licking it’ 
we-netale-n-o? ‘he is licking them’ 


picena-o? ‘he cuts it’ 
we-picena-o? ‘he cuts them’ 
picena-n-o? ‘he is cutting it’ 
we-picena-n-o? ‘he is cutting them’ 


1. Identify the underlying forms of the morphemes for ‘lick’, ‘cut’, ‘3rd-sg-SUBJ’, ‘3rd-sg- 
OBJ’, ‘3rd-p!-OBJ’, and the ‘PROGRESSIVE’ (cf. English to be [Verb]-ing). (NB: One of these 
does not have a phonological form in Tonkawa). 

2. Tonkawa has the rules CONTRACTION and TRUNCATION: 

(i) contraction V — Ø / # CVC __ CV 

(ii) TRUNCATIONV > Ø / V 

Give derivations (i.e. underlying forms, rule applications, surface forms) of ‘he licks it’, ‘he 
is cutting them’ and ‘he cuts them’. 


There are many pairs of rules that do not interact, like Tonkawa CONTRACTION and TRUN- 
cation in the previous question. Such rules are unordered. In order to show that it is the 
sequential application of phonological rules that provides the simpler description, we 
will consider two rules of English, both of which apply to the plurals of nouns. As you 
may remember, the plural suffix of English has three alternants. After (s,z,J,3,tf,d3], 
the alternant [1z] is used; after voiceless segments other than [s,f,tf], [s] occurs, while 
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elsewhere [z] is used, which we argued in Chapter 4 is also the underlying form of the 
morpheme. The plurals of [bas], [pen] and [buk] (bus, pen, book) are, respectively, 
[basiz], [penz] and [buks]. Recall that we need two rules: one that inserts [1] between 
the stem and the suffix if the stem ends in [s,z,J,3] (I-INSERTION),and one that devoices 
[z] after voiceless segments (DEVOICING). We repeat this derivation in (13). 


(13) bas-z pen-z buk-z 
]-INSERTION basiz (n.a.) (n.a.) 
DEVOICING (n.a.) (n.a.) buks 

basiz penz buks 


Which theory allows us to write the simpler rules, the simultaneous-ordering theory or 
the linear-ordering theory? For the formulation of I-INSERTION it would not make any dif- 
ference which theory we adopted, because under either theory the rule applies to the 
underlying form. This is necessarily so under a simultaneous mode of application, 
and in an ordered mode of application, it would simply be ordered first. This rule is 
given in (14). 


(14) 1-1Nsertion 0 1/[+strident] __ +z 


Now consider the second rule, pevoicinc. In a simultaneous-ordering theory, we would 
have to make sure that it does not devoice the [z] in cases like [bas-z]. That is, the rule 
needs to express ‘devoice [z] after all voiceless segments except [s,J,tf], i.e. after 
[p,t,k,f,0]’. However, [p,t,k,f,8] do not form a natural class: phonological rules do 
not typically refer to this group of sounds. Indeed, in the theory of distinctive features 
it cannot be characterized without the suspect ‘either ... or’ braces, and even then it 
must be assumed that [tf] is really a sequence of [t] and [f]. This clumsy formulation 
is given in (15). It says: ‘after a voiceless obstruent which is either [—strid] (i.e. [6]) 
or [LABIAL] (i.e. [p,f]) or [一 cont] (i.e. [p,t,k])’. Incidentally, [tf] is also included by 
[— cont], of course, which is why we needed to assume that this affricate is a sequence 
of [一 cont] [t] and [+cont] [f] quite against the intuition of native speakers, who 
consider [tf] a single consonant. 


(1 5) —voice 
DEVOICING [ —son] 一 [ —voice] / Sina # 
[LABIAL] = 
[—cont] 


Now consider the other option. If pgvoIcING is allowed to apply to the output of 
I-INSERTION, we could simplify (15) to (16). This is because at the time that pevoicine 


gets to apply to [basiz] (the output of I-INSERTION), the [z] no longer appears after a 
voiceless segment, but after [+ voice] [1]. 


(16) DEVoICING [—son] 一 [一 voice]/[ 一 voice] __ # 


This example is representative of the difference between the two modes of application. 
The assumption that rules apply to the output of other rules allows us to formulate 
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phonological rules as simple generalizations, while an assumption that rules always apply 
to the underlying form forces us to repeat information already stated in other rules. 

The theory of linear rule ordering also assumes that the order of rules is constant. 
That is, if two rules apply in the order A—B to one form, they cannot apply in the order 
B—A to the next. The ordering is also transitive. This means that if there are three 
rules A, B and C, where rules A and B apply in the order A—B and rules B and C apply 
in the order B—C, then rules A and C necessarily apply in the order A—C. 


6.5 EXTRINSIC AND INTRINSIC ORDERING 


Is the order in which rules apply predictable from any properties of the rules con- 
cerned? If it is, no ordering statement would be necessary: the rule order is said to be 
intrinsic. If the order is not given by the theory, and an explicit ordering statement of 
the type ‘Rule X applies before Rule Y’ is necessary, the rule order is extrinsic. The 
issue of intrinsic rule order occupied many phonologists in the 1970s, but the search 
for the principles that exhausively govern the order in which rules apply is generally 
considered to have been unsuccessful (Kenstowicz and Kisseberth 1977). A principle 
that has stood the test of time is the ELSEwHERE CONDITION. This is really a principle gov- 
erning the application of rules in general, and has been invoked in morphology as well 
as phonology. What it says is that when one rule applies to a subset of the forms that 
another rule applies to, the general rule is blocked from applying to that subset. So it 
is not just a principle governing order, but also application as such, in the sense that 
only one of the two rules will be allowed to apply. A morphological example will make 
it clear why this is a useful principle. Take the English morphological rules given in 
(17) and (18). Rule (17) says: “Attach the suffix [z] to noun stems in order to form the 


plural’, and rule (18) says: ‘Attach the suffix [an] to the noun stem ox in order to form 
the plura. 


(17) [E Iy Zou 
(18) ~ [[oks], 2M) un 


In order to prevent the formation of *oxes, we must either stipulate that (18) applies 
before (17), or add to (17) the clause ‘except in the case of {oks]’. The EtsEwHERE 
CONDITION makes either move unnecessary: because (18) applies to a subset of the con- 
texts specified by (17), it automatically blocks (17). It will be clear that this principle 
saves us from having to add all sorts of exception clauses to general rules. 
A Finnish phonological example, from Kiparsky (1973a), is given in (19). 


(19) Underlying Derived 
menek mene ‘go’ 
menek alas mene alas ‘go down’ 
menek pois menep pois ‘go away’ 


menek kotiin menek kotiin ‘go home’ 
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Word-final [k] in Finnish is deleted, unless a consonant follows, in which case the [k] 
assimilates to that consonant, creating a geminate. The k-assimitation rule is given in 
(20). The rule at issue is k-DELETION. Without the ELSEWHERE CONDITION we would have to 
state this rule as (21), which explicitly specifies the context ‘either when followed by a 
vowel or by pause’, i.e.‘except when a C follows’. 


k#C 
(20) k-assIMILATION 123 > 323 


(21) k-pELETION k > @/__ # | | 
pause 


If we were to formulate k-DELETION as (22), the ELSEWHERE CONDITION would ensure that it 
would only apply if (20) did not apply. Rule (20) applies to a proper subset of the 
forms to which (22) is applicable, and therefore applies first, blocking (22). As a result, 
(22) cannot apply to [menek pois]. Also, (22) will not apply to [k] in [menek kotiin], 
which after all was input to rule (20). In this form, (20) assimilated the ‘original’ [k] of 
[menek] to the [k] of [kotiin]. Although it applied vacuously, it did apply, thereby 
blocking (22). 


(22) k-peterion k-—>0/__ # 


Although a number of other principles governing rule ordering have been proposed, 
they have not stood the test of time. (The theory of Lexical Phonology will be seen to 
provide a partial answer to the question of intrinsic ordering, as discussed in Chapter 8.) 

A widely quoted argument for extrinsic ordering is based on a case of dialectal vari- 
ation in Canadian English (Joos 1942). The difference between the two dialects con- 
cerned can be described as resulting from different orderings of the same two rules, 
FLAPPING and PRE-FORTIS CLIPPING (the terms are from Wells 1982). The first rule, given in 
(23), causes a [t,d] to be pronounced as an alveolar tap before reduced syllables, as in 
[Siri] city, ['berr] better. Note that [r] is voiced. The second rule (24) shortens vowels 
and-sonorant consonants preceding voiceless segments. As a result of this rule, the [i:] 
in [bi:d] bead is longer than that in [bi:t] beat, the [en] in [tenz] tens is longer than the 
[en] in [tens] tense, and the [i:] in ['ti:z1n] teasing is longer that in ['li:s1] leasing. (More 
adequate statements of these rules are possible, but not necessary for the point at issue.) 
We will use an ad hoc feature [long]. 


V 
(23) FLAPPiNG [t,d] >r / [—cons] __ pra 


(24) PRE-FORTIS CLIPPING [+voice] 一 [—long] /__ [— voice] 


In some dialects, these rules apply in the order FLAPPING — PRE-FORTIS CLIPPING. That is, the 
words rider (‘someone who rides’) and writer (‘someone who writes’) are homo- 
phones, both being pronounced ['ratrar], while ride and write are [raid] and [rart] 
(where [AI] represents a shortened [at].) This is shown in (25). 
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(25) ‘raid-ar Tart-3r raid rait 
Rule (23) r r (n.a.) (n.a.) 
Rule (24) (n.a.) (n.a.) (n.a.) Al 
Output ‘ralrar ‘ralrar raid ralt 


In most Canadian speech, however, the pronunciations of these same words are 
[ratrar], [ratrar], [rard] and [rart]. This situation is obtained if we reverse the order of 
the rules, as shown in (26). 


(26) ‘raid-ar ‘rait-or raid rait 
Rule (24) (n.a.) Al (n.a.) Al 
Rule (23) c È (n.a.) (n.a.) 
Output ‘ralrar 'TALfor raid rait 


Clearly, if dialects can differ depending on the order in which two rules apply, it can- 
not be the case that rule ordering is predictable. 


6.6 FEEDING, COUNTERFEEDING, BLEEDING, COUNTERBLEEDING 


Early attempts to find universal principles governing the order in which rules apply 
led to a categorization of rule interactions (Kiparsky 1968). The idea was that certain 
rule orderings were more natural or expected than others, and that phonological 
change could in part be explained by assuming that in the course of time rules tend to 
reorder so that they come to have natural orders. While the principles have been aban- 
doned, the terms used to refer to the different types of interaction have acquired wide 
currency, and we therefore give them here. Four types will be distinguished. In every 
case two rules are assumed, which will be referred to as rule A and rule B. 


6.6.1 Feeding order 


If Rule A increases the number of the forms to which rule B can apply, the order A—B 
is a feeding order. The British English data in (27) illustrate PREGLOTTALIZATION, given in 
(28), which rule glottalizes voiceless plosives in the coda. 


(27) Underlying Derived 
lukt lu‘kt looked 
kets ke'ts cats 
hints hin'ts hints 
kempgraund kem'pgraund campground 


(28) PREGLOTTALIZATION 


—cont : 
Eai 一 [+constr] / [+voice] __ (# #)C 


Now consider the data in (29). In the first and second columns, we see that between a 
nasal and a voiceless fricative in the same syllable a voiceless plosive is inserted, whose 
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place of articulation is that of the preceding nasal. As shown in the third column, this 
rule of FORTIS PLOSIVE INSERTION (30) applies before (28), and thus causes words like 
prince, which underlyingly end in a nasal consonant followed by a fricative, to be input 
to rule (28). That is, (30) feeds (28). 


(29) Underlying Fortis stop Preglottalization 


prins prints prints prince 
len@ lenk len?kð length 
woimb wo:mp0 woim?p0 warmth 
(30) FORTIS STOP INSERTION 
—cont C CE 
Ø 一 | 一 voice | /| ...| +nas | 一 | +cont |... |o 
QLPLACE QPLACE —voice 


6.6.2 Counterfeeding order 


If Rule A increases the number of the forms to which rule B can apply, the order B— A 
is a counterfeeding order. The French feminine adjectival suffix is [a]. The language 
has a rule that deletes word-final [ə], which is given as 9 -DELETION in (31). The lan- 
guage also has FINAL CONSONANT DELETION (32), which deletes certain word-final conso- 
nants in contexts other than before a vowel or glide, causing [patit] to be pronounced 
[pəti] in the context concerned. FINAL CONSONANT DELETION applies before 3-DELETION. If the 
rules applied in the opposite (feeding) order, 9-DELETION would cause the feminine alter- 
nant of ‘little’ to be homophonous with the masculine form. In other words, the rules 


apply in counterfeeding order, as illustrated in (33) for the forms for ‘little nephew and 
‘little niece’. 


(31) 3-DELETION 9 > O/__ # 


(32) FINAL CONSONANT DELETION C—>@/__ ##[ + cons] 


(33) patit nave patit-a njes 
Rule (32) 0 (n.a.) 
Rule (31) (n.a.) 0 
pati nove patit njes 


6.6.3 Bleeding order 


If Rule A decreases the number of the forms to which rule B can apply, the order 
A—B is a bleeding order. This type of interaction, in which Rule A prevents Rule B 
from applying to particular forms, occurs in the derivation of the English plural. 
Recall that I-INSERTION (14) prevents DEVOICING (16) from applying to form like [bas-z] 
by separating the final [z] from the stem-final obstruent. In (34), repeated from (13), 
this interaction is shown. That is,I-INSERTION bleeds DEVOICING。. 
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(34) bek-z kis-z al-Z 
I-INSERTION (n.a.) I (n.a.) 
DEVOICING s (n.a.) (n.a.) 
bæks kisiz aiz 


Thus, when two rules A and B are in a counterfeeding order, the application of rule A 
does not increase the number of forms to which rule B can apply. When they are in a 
bleeding order, rule A actually decreases the number of forms to which rule B can apply. 


6.6.4 Counterbleeding order 


If Rule A decreases the number of the forms to which rule B can apply, the order B-A is 
a counterbleeding order. If two rules can apply to the underlying form, this order enables 
both rules actually to do so. In the Kaatsheuvel dialect of Dutch, [a] is inserted between 
the noun stem and the diminutive suffix, if the stem ends in a lax vowel followed by a 
sonorant consonant. Thus we find it in (35a), but not in (35b), which stem ends in an 
obstruent, or in (35c), which stem has a tense vowel. The rule of 3-INSERTION is given 
in (36). Here, X is a free variable, and represents whatever shape the diminutive suffix has. 


(35) a. snor-ka snoraka ‘moustache’ 
hal-kə haləkə ‘hal? 
kam-ko kaməkə ‘comb’ 

b. lap-kə lapkə ‘cloth’ 
mys-ka myska ‘sparrow’ 
c. raim-kə raimka ‘window’ 


+cons 
(36) 3-INsERTION Ø > 2/[—tense] | Tsoi J+ Xl 


Kaatsheuvel also has a rule which inserts [s] between the dimunitive suffix [kə] and a 
stem-final dorsal consonant. Thus, $-INSERTION (37) breaks up a sequence of two dorsal 


consonants, as in [bakska], from [bak- ka] ‘tray’, [ma:xske], from [ma:x-ka] ‘stom- 
ach. 


ko] 


DIM 


(37) Ss-INsERTION 0 一 S/ peed 
DORSAL 


If we want to know how these two rules interact, we need to consider the diminutive 
form of a word like [slap] ‘snake’, which satisfies both rules. Let us suppose — 
contrary to fact - that it is *[slanako]. In order to obtain this form, we would have to 
apply 3-INSERTION first, so as to destroy the context of the two adjacent dorsal conso- 
nants. This would be a bleeding order. The correct form, however, is [slanaska]. That 
is, we need to apply the rules in a counterbleeding order: first (37), then (36). 


(38) bak-ko slan-ka snor-ka 
Rule (37) s s (n.a.) 
Rule (36) (n.a.) 3 9 


bakska slanasko snoraka 
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Q55 Mwera has three rules, given as (1), (2) and (3) below. Two noun stems in Mwera are 
[gomo] ‘lip’ and [kuja] ‘cape bean’. The plural is formed by prefixing a nasal consonant, 
whose underlying form is [n]. The plural surface forms are [nomo] and [nguja]. The 
following three rules derive the surface forms (Kenstowicz and Kisseberth 1977: 157): 


—son 
—cont | > @/{+nas]__ 
+voice 


> | — [+voice] / Bes | Ra 
-6 


ont +nas 


TSY | — [apace features] / __ 0 
as QPLACE features 


Suggest suitable names for these rules. 
Please show that of the six possible orders that these rules could have, only one is correct. 


Q56 Dutch has a number of rules affecting the feature [+ voice] in obstruents. In order to 
derive the surface forms in the second column from the underlying forms in the first, 
four rules are required: DEGEMINATION, FINAL DEVOICING, PROGRESSIVE DEVOICING and REGRESSIVE 
voicinG. The first two can be formalized as follows: 


DEGEMINATION oe (#) [+cons] 


2 3 > 0923 
Condition: 1 = 3 
FINAL DEVOICING [— son] 一 [—voice]/__ Co # 
Underlying Derived Gloss 

a. la:t#blujar la:dblujar ‘late developer’ 
b. bad#bruk badbruk ‘bathing trunks’ 
c. kaud#vy:r kautfy:r ‘gangrene’ 
d. vand#te:xal vanterxal ‘wall tile’ 
e. le:z#faut le:sfaut ‘reading error’ 
f. lup#zoeyvar lupsceyvar ‘very pure’ 
g- bo:t#toxt boztoxt ‘boat trip’ 
h. ka:z#za:k ka:sa:k ‘cheese shop’ 
i. kop#bal kobal ‘header (football)’ 
j. xaud#delvar XAudelvar ‘prospector’ 
k. xaud#ko:rts XAutko:rts ‘gold fever 
|. krab#sxa:v krapsxa:f ‘scraper’ 
m. le:z#bril le:zbril ‘reading glasses’ 
n. heis#balk heizbalk ‘hoisting beam’ 
1. Determine what the other two rules should do, and how the four rules should be ordered. 
2. Give formal notations of PROGRESSIVE DEVOICING and REGRESSIVE VOICING. 
3. Give the derivations of items d, f, h, i, m. 
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Q57 The river Linge [lina] flows not far from Kaatsheuvel. What would you expect the 
diminutive form to be? 


Q58 In (26), the order in which FLAPPING and PREFORTIS CLIPPING are applied to rider and writer 
in mainstream Canadian English results in different surface forms for these two 
words. What type of rule order is this? 


6.7 OPACITY 


The most striking advantage of allowing rules to apply to the output of other rules is 
that they can be formulated in more general ways. Importantly, they express linguistic 
generalizations, even though they are not true ‘on the surface’. This goes for both 
counterfeeding and counterbleeding rule orders. In the face of the final [t] in the fem- 
inine form [patit] in [patit njes] ‘little niece’, French FINAL CONSONANT DELETION makes no 
good sense from a surface perspective. Similarly, the counterbleeding application of 
S-INSERTION before 9-INsERTION, discussed above for Kaatsheuvel Dutch, produces the 
surface form [slanaska] from an underlying /slan-ko/. However, in this surface form 
there is no apparent reason for the presence of the inserted [s], since it does not appear 
between dorsal consonants. The generalization embodied by S-INSERTION,'[S] occurs so as 
to break up a sequence of two dorsal consonants’ is, again, not surface true. That is, the 
presence of [s] makes the form opaque. 

As observed in section 4.3, the fact that phonological generalizations are not 
‘surface-true’ presents a problem in OT, if only output-based constraints are 
allowed. In order to illustrate this, we need two constraints in addition to the faithful- 
ness constraints introduced in Chapter 3, which we will refer to as S-INSERTION and 


9-INSERTION. Tableau (39) shows the fate of a number of output candidates for underly- 
ing /slan-ko/. 


(39) 
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Max-IO is violated in the last output form [slaka], since the input [0] is not repro- 
duced. The constraints Dep-IO(v) and Dep-IO(c) will be violated if, respectively, a 
vowel or a consonant is inserted that was not present in the input. Finally, S-INSERTION is 
violated if a sequence of two dorsal consonants is not broken up, while 9-INsERTION is vio- 
lated in diminutives if [9] fails to appear after a stem ending in a lax vowel and 
a sonorant consonant. As tableau (39) shows, there is no way in which the sad- 
faced losing form [slanoske] can beat the winning form [slanaka]. They both 
violate Dep-IO(v), but in addition, [slanaska] incurs a violation of Dep-IO(c). With 
[slanoko] around, this means that no possible ranking can ensure that [slanaska] is 
characterized as the optimal output form. 

A number of proposals have been put forward to deal with opacity in OT. We will 
briefly mention two. The first is to allow for intermediate representations between the 
input, the UR, and the output, the SR. Instead of just two levels of representation and 
one grammar connecting them, there would be three, allowing two grammars to be 
formulated, or four, giving three grammars (Kiparsky 1998, Rubach 2000). The exist- 
ence of more than two levels in phonology will form the topic of Chapter 8. For now, 
we could assume one extra level, making it possible to first produce an intermediate 
winning candidate /slansko/, which form we then take as input to the next grammar 


so as to produce a winning [slanaska] at the end. This is illustrated in tableaux (40) 
and (41). 


(40) First level 


ss’ [slanaska] 
[slaka] 
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While a level-ordered grammar presents a solution to many problems, in particular 
those discussed in treatments of Lexical Phonology, in this particular case, the inter- 
mediate level is not well motivated. To the native speaker, the form [slanska] is just 
wrong and does not appear to represent a reality in any sense. In Chapter 8, we will 
motivate an intermediate level, the ‘lexical representation’, that does have psychological 
reality. 

A second way in which OT can deal with opacity is to allow candidates to be evalu- 
ated not just with respect to the input form, but also with respect to other output can- 
didates. We could judiciously take [slanska] as the output form with respect to which 
other forms are evaluated. Suppose that a constraint is added, Max-!$!95*1Q, which 
demands output candidates to be maximally faithful to the output candidate 
[slanska], as shown in (42). 


The output form [slanaska] is now characterized as the optimal output form, because 
compared with [slagaka], which like [slanaska] has a [a] too many but unlike 
[slanaska] has an [s] too few, it is more faithful to Max-!S!55**10, This way of dealing 
with opacity is known as sympathy theory (McCarthy 1999), where the output can- 
didate to which other forms are evaluated, here [slanska], is called the ‘sympathetic 
candidate’ and the constraint concerned, here Max-'S!"§°10, the sympathy constraint. 
This approach is not without its problems either. For one thing, the choice of the ‘sym- 
pathetic candidate’ seems arbitrary. Moreover, cases of opacity have been presented 
which sympathy theory cannot account for (McCarthy 2002). 


6.8 CONCLUSION 


The notational conventions of SPE rules, such as parentheses, variable feature values 
and braces, were originally introduced as the essential elements in a phonological 
grammar whose significance lay in the simplifying (‘ink-saving’) effect they had on 
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the formulation of those rules. Over the years, these notational conventions have lost 
this strict theoretical status as a result of the breakthroughs in our conception of 
phonological representations made in the late 1970s and early 1980s. From Chapter 9 
onwards we will introduce these new phonological representations, which are collect- 
ively known as ‘nonlinear’ representations. 

Phonological rules can be ordered pairwise in four ways, some of which increase 
and some of which decrease the number of forms to which the second rule can apply. 
Despite an intensive research effort made in the early 1970s, rule ordering has 
remained an unpredictable aspect of derivational grammars. Phonological general- 
izations that are not ‘surface-true’, which are dealt with by counterfeeding and coun- 
terbleeding rule orders, have proved reluctant to yield to OT analyses. We will return to 
OT in Chapter 14. 

The next chapter provides an illustration of the SPE derivational framework in the 
form of a study of the distribution of the morpheme alternants of the diminutive suf- 
fix in two varieties of Dutch. After that, Chapter 8 is concerned with an issue which is 
relatively independent of the issue of representation and the issue of how to describe 
phonological adjustments. It concerns the question whether two levels of representa- 
tion, one for underlying representations and one for surface representations, suffice to 
explain what we know about the pronunciation of languages. 


7 


A case study: the diminutive suffix in Dutch 


7.1__ INTRODUCTION 


In this chapter, we will apply the SPE theory outlined in Chapter 6 to the alternations 
in the diminutive suffix in two varieties of Dutch. First, we describe the diminutive 
suffix in Standard Dutch, historically a western variety. A number of ordered rules 
will appear to be necessary to derive the surface alternants of the diminutive suffix. 
Next, we describe diminutive formation in Sittard Dutch, a south-eastern variety. 
This dialect has a different underlying form for the suffix, and has a number of 
different rules needed to derive the surface alternants. Moreover, some of the rules 
that they have in common will have to be ordered differently to obtain the correct 
results. ! 


7.2 _ GENERAL DUTCH RULES 


There are a number of phonological rules that all varieties of Dutch share. For the pur- 
poses of diminutive formation, there are three such variety-neutral rules: FINAL DEVOICING, 
PALATALIZATION and DEGEMINATION. 


7.2.1 Final devoicing 


Recall that in Dutch, no voiced obstruents can appear in the coda, a constraint which 
we referred to earlier as FINALDEVOICING (see Chapter 6). For instance, underlying [bord] 
‘plate’ only appears with its [d] in the plural ['bərdən], its singular form being [bərt]. 
Because any underlying obstruents in the words to be discussed in this chapter 
appear in coda position in the diminutive form, we will represent all coda obstruents 
as voiceless in these data. 


'The description for Standard Dutch is based on Gussenhoven (1978). For treatments of diminutive formation 
in (standard and nonstandard) Dutch, see van den Berg (1975), Trommelen (1983), Booij (1984), van der Hulst 
(1984), Hamans (1985), Nijen Twilhaar (1990), Wetzels (1990), Haan (1996) and van de Weijer (2002). 
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7.2.2  Palatalization 


The articulatory contact for the coronals [t,s,n] is between the tip and/or blade of the 
tongue and the alveolar ridge, the sides making a complete closure with the gums. 
When these consonants appear before [j], the contact is between the forward part of 
the front of the tongue, i.e. the zone behind the blade, and the palatal area immediately 
behind the alveolar ridge. Thus, instead of alveolar, these coronal consonants are 
prepalatal in this context, and PALATALIZATION (1) thus assimilates [t,s,n] to their 
[一 anterior] counterparts [c,J,n]. For instance, before the pronoun [ja] ‘you’, the ver- 
bal forms [e:t] ‘eat’, [las] ‘weld’, [kan] ‘can’ are pronounced [etc], [laf], [kan]. The [j] 
itself tends to be absorbed by the preceding consonant in what is sometimes referred 
tO aS a COALESCENT ASSIMILATION process. Thus, ‘eat you’ (‘are you eating?’) is [e:cja] or 
[e:ca]. The coronal appproximant [I], which is articulated with the tip touching the 
alveolar ridge, does not undergo the rule. PALATALIZATION may affect more than one con- 
sonant. For instance, in [lint] Yibbon both [n] and [t] will become prepalatal before 
[j], as in [linca]. In Chapter 11 we will see that this can be achieved in a single opera- 
tion in a ‘nonlinear’ representation of segments, in which the place features for [n] and 
[t] would be specified only once for both segments. In the ‘linear’ SPE representation, 
each segment is separately specified for the place features, and we therefore need to 
include the term Co (i.e. ‘ignoring any following consonant(s)’) in the rule to make 
sure the focus can be [n] if [t] follows (as well as [t,n,s] when # # follows). As will be 
clear, the rule can apply across word boundaries. (The use of a transformational for- 
mat would also be possible.) 


(1) PALATALIZATION ae 一 [一 ant] /__ Co (# #) [—ant] 


In Sittard Dutch and the dialects further to the south, the (pre-)palatal segments that 
result from the rule also appear in underlying representations, as in [lenc] ribbor and 
[man] ‘basket’ (Schelberg 1979). In such cases, there is no reason to assume that a [j] 
follows these segments, either in the underlying representation or on the surface. 
Because the segments derived by PALATALIZATION are not distinct from the underlying 
segments, we will assume that [j] deletes after the application of PALATALIZATION in Sittard 
Dutch, and we will make the same assumption for Standard Dutch. 


7.2.3 Degemination 


DEGEMINATION forbids the occurrence of a sequence of identical [十 cons] segments 
(cf.Q34 and Q56). It applies, for instance, to the past tense of the verb [he:t] ‘be called’, 
which consists of the verb stem and the suffix [ta]. The result is [he:ta], with one [t], 
which form is homophonous with the inflected form of the adjective [he:t], which 
consists of the stem and the suffix [9]. We give the rule in (2), in a formulation that will 
allow it to apply both within and across word boundaries. 


(2) vecemination [十 cons] (# #) [+cons] _, g 23 (Condition: 1 = 3) 
1 2 3 
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7.3_STANDARD DUTCH 


fv VIA eee 


In Standard Dutch, the diminutive suffix has five alternants: bisyllabic [aca], illustrated 
in (3a), and the four monosyllables [ca] (3b), [kja] (3c), [pja] (3d) and [ ja] (3e). 


(3) Stem Diminutive 
a. bal balaca ‘ball’ 
b- fi gice ‘egg 
c. komen  koməpkjə ‘king 
d. raim ra:mpjə ‘frame 
e rok rokj3 ‘skirt’ 


A reasonable strategy is to take the alternant appearing after vowels to be the under- 
lying form (cf. (3b)). Before PALATALIZATION, this alternant [co] is [tja]. Next, in order to 
obtain the bisyllabic form of (3a), we need to insert [a] after some nouns. Third, in 
order to get [ja], we need to delete [t]. Fourth, [pja] and [kja] can be produced by a 
rule that changes the place of articulation of [t] to that of a preceding consonant. We 
will call these rules 9-INSERTION, t-DELETION and PLACE ASSIMILATION, respectively. The data 
in (4a) illustrate 9-INSERTION. 


(4) a. slan-tja slanatja ‘snake 
bom-tja bomatja ‘bomb’ 
pan-tja panatj3 ‘pot’ 
bal-tja balotja ‘ball 
snor-tja snorat ja ‘moustache’ 

b. tok rokja ‘skirt’ 
krary kraryja ‘collar’ 

c. ram raimpja ‘frame 
oir oirtja ‘ear’ 


ko:non koinankja ‘king’ 


All of the stems in (4a) end in a sonorant consonant, in contrast to those in (4b). The inser- 
tion of schwa thus appears to be triggered by the value of [+ son] of the stem-final conso- 
nant. However, no schwa is inserted after the stems in (4c), which also end in a sonorant 
consonant. A further requirement, therefore, is that the vowel in the last syllable should be 
[一 tense] (cf. also rule (36) on p. 89). The (nonmarginal) monophthongs of Standard 
Dutch are given in (5). The five lax vowels are given on the left of each cell, and the seven 
tense monophthongs on the right. Dutch also has the diphthongs [ei,cey au], which end in 
[+tense] second elements [i,y,u] and thus are correctly exempted from the rule. 


aoe! [十 round] | [—round] | [+round] 

Re ra al 

| (high-low) | se | œ [| 
[high +ow) | | we | 
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If we assume that the vowel [9] is neither tense not lax, and the feature [tense] is 
therefore irrelevant for this segment, we can formulate the rule as in (6). In Chapter 14, 
we will see that there is the additional condition that the last syllable should be the 
head of a foot, a fact which we will ignore here. 


+cons 
(6) 3-INsERTION 0 — 3/ [ — tense] ted fe sh ON 


We must specify in the rule that [tja], indicated as X, represents the diminutive suffix, 
because the rule does not apply to similar segment strings arising in different mor- 
phological contexts. Thus, {lam-ta] ‘lame-ness’, or the word sequence [kam-t ja] ‘comb- 
3sG your’, are not subject to the rule. 

We move on to the alternant [ja], which appears in the data in (7). The forms in 
(7b) have undergone PALATALIZATION of [s] before [j], after [t] was deleted. 


(7) a. lap-tja lapj3 ‘cloth 
dreeyf-tja dreeyfja ‘grape’ 
krary-tja kraryja ‘collar 


rok-tja rəkjə ‘skirt 
b. _ fars-tja fafa ‘vase’ 
pas-tja pafa ‘step’ 


The observation to be made here is that [ja] appears after obstruents. What we need, 
then, is t-DELETION, as given in (8). Again, since [t]s are not deleted in comparable 


phonological contexts arising elsewhere, reference to the suffix is included in the 
rule’s SD. 


(8) t-petetion t > 0/[—son] + __ ja] 


DIM 


A complication arises in the case of stems that end in [t]. In the data in (9a,b) the 
stem-final [t] is itself preceded by an obstruent. As shown, both the [t] of the stem 
and the [t] of the suffix are deleted in such cases. (In (9), we see the effect of PALATALI- 
ZATION in the word for ‘fist’ on [sj], which sequence arises after the deletion of both [t]s, 
as well as on [nt] in the word for ‘ribbor’.) 


(9) a. bəyt-tjə boxjo ‘bend’ 


kaft-tja kafja ‘book cover’ 
b.  foeyst-tja fæyfə ‘fist’ 
c.  lint-tja linca ‘ribbor 


In order to get rid of stem-final [t] before [tja], we invoke DEGEMINATION (2). The deri- 
vation in (10) shows how (2) and (8) delete both [t]s in [boyt + tj3]. In forms like 


[lipnos] (9c), in which a stem-final [t] appears after a sonorant, DEGEMINATION bleeds 
t-DELETION, also as shown in (10). 


(10) boyt-tja rok-tja Imt-tjo 
DEGEMINATION (2) box tja (n.a.) lind tj3 
t-DELETION (8) boy Oja rok Oja (n.a.) 
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Last, we need to account for the assimilation of [t] to [p,k]. Some nouns undergoing 
PLACE ASSIMILATION are given in (11). Evidently, [t] assimilates to the place of articulation 
of the stem-final nasal. (Since tense vowels are not allowed before 上] in Dutch words, 
we can only show the assimilation of [t] to [k] after an unstressed syllable.) If we for- 
mulate the rule as in (12), where QpPLACE stands for all the features specifying the place 
of articulation, it will apply vacuously to [n-tja], as in [sxun-tja]. 


(11) a. ra:m-tja raimpja ‘window’ 
be:zam-tj3 be:zəmpjə ‘broom 
kornan-tja komnankja ‘king’ 


12 +nas : 
(12) PLACE ASSIMILATION t —> [QPpLAcE] / N — Jol im 


Representative derivations are given in (13). (We will use a dash for ‘nonapplication’ 
from now on.) 


(13) ei-tjə raim-tja bəm-tjə lint-tja bəyt-tjə lap-tjə 
3-INSERTION (6) = - bom-atja - - - 
DEGEMINATION (2) 一 一 一 lingtja bəxð-tjə - 
t-DELETION (8) - - 一 一 bax-0j3 lapja 
PLACE ASS (12) 一 raim-pjə - vac. 一 - 
PALATALIZATION (1) ica. = bomaca linca z = 
Output gica raimpj3 bəməcə linca boyja lapja 


What is the diminutive form of [flam]? Give the derivation of this form. Demonstrate that 
Ə-INSERTION Needs to apply before PLACE ASSIMILATION. 


What is the diminutive form of [fe:st] ‘party’? Give the derivation of this form. Demonstrate 
that DEGEMINATION Should apply before t-DELETION. 

The diminutive of [hemt] (hemd ‘shirt’) is [hempja]. Give the derivation of this form. Show 
that PLACE ASSIMILATION is Crucially ordered after DEGEMINATION. 


7.4 THE DIALECT OF SITTARD 


The phonology of diminutive formation in the dialect of Sittard differs in a number of 
respects from the same morphological process in the standard language. Table 7.1 
gives a comparative survey of 33 etymologically related words. First, as can be seen in 
[eikə] (item 1), the basic form of the suffix would appear to be [-ka] rather than [-ca]. 
The alternant [ca] does occur, though, as in [tencə] (item 13). Second, Sittard 
has no 9-INSERTION. All the Sittard forms that correspond to diminutive forms that have 
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Table 7.1 
Noun stems and diminutive forms in Standard Dutch and the dialect of Sittard 


Gloss Standard Dutch Sittard 

1. ‘egg’ gi ticə gi tikə 

2. ‘rag’ lap lapja lap lepka 

3. ‘grape’  dræyf dræyfjə drust dry:fkə 

4. ‘collar’ kraX kra:xja kra:x kre:cska 

5. ‘skirt’ nk Dkja nk roekska 

6. ‘king’ ko:nan ko:nankja ko:nan ko:nanska 

7， ‘snake’ slan slanaca flan flenska 

8. ‘needle’ nailt na:lca mh næ:Ákə 

9. ‘chair stul stulca foul ftceylka 
10. ‘adventure’ aivən'ty:r a:van'ty:rca a:van'ty:r aivan'ty:rkə 
11. ‘cucumber’ kmkomar kom'komarca kum koumar kumkymarka 
12. ‘evening’ arvant arvanca d:vant cervanica 
13. ‘aunt’ tanta tantace tant tenca 
14. ‘side’ kant kanca kanc kenca 
15. ‘ribbon’ lint inca lenc lenca 
16. ‘basket’ mant manca map menca 
17. ‘flame’ flam flamaca vlam vlemkə 
18. ‘bomb’ bəm bomaca bum bymka 
19. ‘pot’ pan panaca pan penka 
20. ‘thumb’ doeym deeympja du:m dy:mka 
21. ‘broom’ be:zam be:zampje besam besamka 
22. ‘frame’ raim ra:mpjə ram remka 
23. ‘ball’ bal balaca bal belka 
24. ‘moustache’ snor snoreca for Jnoerka 
25. ‘ear’ or orca or g:rka 
26. ‘shoe’ sxun sxunca fo:n Jø:nkə 
27. ‘vase’ fais fa vais verska 
28. ‘step’ pas pafa pas peska 
29. ‘cover’ kaft kafja kaf kefka 
30. ‘fist’ foeyst feeyfa vuis vy:ska 
31. ‘curve’ bəxt boyja box bæçskə 
32. ‘foot fut fuca vout veeyca 
33. ‘plate’ bort borca bort beerca 


inserted schwas in the standard language appear without schwa in Sittard (cf. items 7, 
17, 18, 19, 23, 24). Third, Sittard has UMLAUT, a rule which fronts stressed vowels pre- 
ceding the suffix in the stem (for instance items 2-5). Fourth, an alternant [-sko] 
appears in, for instance, [ flenska] (item 7). 
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Taking our cue from the alternants appearing after vowels, we will assume that [ka] 
is the underlying form. First, we will take care of uMLAUT. The requirement appears to 
be that the last stressed (nonschwa) vowel of the stem should be front. The monoph- 
thongs of Sittard Dutch are given in (14). Whenever the last full vowel of the stem 
is [+back], the diminutive has the [一 back] counterpart, as can be seen in 
items 2, 3, 4, 5, 8, 9, 11, 25. Because in [kum'kumar] (item 11) the umlauted vowel is 
followed by an unstressed syllable in the stem, provision has been made for an 
optional schwa-containing syllable in the formulation in (15). 


[— back] [— back] [+back] [+back] 
[—round] | [+round] | [—round] | [+round] 
| [thigh iow] | nii | vys | | omm | 


(high, ow forage fh ata 
ik Ho ce [ee [3a 


(15) Srrtarp UMLAUT V 一 [— back] /__ (Cy a) Co + ka] pim 


(14) 


The data for [kra:x], [rok] and [flan] (items 4, 5,7) suggest that there is a rule insert- 
ing [s] between the stem and the suffix, if the stem ends in a dorsal consonant, a rule 
we already encountered in Chapter 6. s-INsERTION is given in (16). 


—syll 
(16) SITTARD s-INsERTION 6 > s/ Ed Fa Djma 


Next, forms like [ce:vanca, lenca, menca] (items 12, 15, 16) suggest that there is a rule 
changing the [k] of the suffix into [c] after stem-final [t,c,n], which we will refer to as 
k-FRONTING, given in (17). The feature that distinguishes these coronal consonants from 
[s] is [~ cont], and the requirement is therefore that the last consonant of the stem 
should be a noncontinuant coronal. The dorsal [k] then acquires the features [cor] and 


[—ant], causing it to be [c]. Here, the assumption is that when [k] acquires [corona], 
it loses [DoRSAL]. 


7 J COR COR 
(17) SITTARD k-FRONTING k —> Bat? he Hi 8) au 


Q60 Sittard has two coronal laterals, [I] and [A], occurring in the words for ‘chair’ and 
‘needle’. Would you class these segments as [+ cont] or [一 contj? Hint: Consider the 
behaviour of k-FRONTING. 


?The lax [£,£:,œ,œ:,3,3:] have been classified as [+low]; if they are analysed as [一 low]，uMLAUT will 
have to change vowels to [—back, —low] instead of just to [—back]. Some speakers contrast a [—low] and 


a [+low] front unrounded vowel before [I], in which case the [+low] vowel sounds like [£] as occurring in 
other contexts. 
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How are the three rules discussed so far ordered? First, UMLAUT does not interfere 
with the other two rules. It is triggered by the suffix, and its result, the front vowel, can- 
not make a difference to the structural description of any of the other rules. We will 
apply it as the first rule in our derivation. At first sight, rules (16) and (17) would also 
appear to be unordered, since they apply in mutually exclusive contexts: (16) applies if 
the stem ends in a velar consonant, and (17) applies if the stem ends in a coronal con- 
sonant. They are, however, indirectly ordered, because both rules are ordered with 
respect to an assimilation rule we have not considered so far. Consider items 19 and 29, 
repeated in (18), whose stems end in [n]. Importantly, [n] does not trigger k-FRONTING. 


(18) pan penka *penca 
Jon fornka *fornca 


The assimilation rule is needed to take [n] to [9] before [k], creating [penka], from [pen- 
ka]. This rule crucially intervenes between (16) and (17). On the one hand, we do not 
want [pen-ka] to undergo k-FRONTING, as shown in (18), but on the other hand, we do not 
wish to subject [pen-ka] to s-insertion (*[penska]). In other words, we first need to apply 
§-INSERTION, then change the [n] of the stem into [n], and then apply k-rrontinc. This 
N-ASSIMILATION, then, counterfeeds s-INsERTION, and bleeds k-rrontinc. The derivation of 
the diminutive form of ‘pot is given in (19), together with those of ‘snake’ and ‘basket’. 


(19) pan-kə flan-ko map-ka 
UMLAUT (15) pen-ka flen-ko men-ko 
S-INSERTION (16) 一 flen-ske - 
N-ASSIMILATION pen-ke - 一 
k-FRONTING (17) 一 - mep-ca 
Output penka Jlenska menca 


As explained above, Sittard shares PALATALIZATION and DEGEMINATION (2) with Standard 
Dutch. However, DEGEMINATION is ordered later than it is in Standard Dutch. First, it must 
be ordered after S-INSERTION (16), in order to prevent DEGEMINATION from removing the 
final [k] of [rok], which would produce the incorrect *[rækəa] (item 5). Second, DEGEM- 
INATION Must come after k-FRONTING (17), since the output of K-FRoNTING feeds DEGEMINA- 
TION in a case like [lenc-kə] ([lepc-kə] 一 [lenc-cə] — [lenca] (item 15). And 
finally, PALATALIZATION must be allowed to feed DEGEMINATION in a form like [tenc-ca], 
which is produced after k-FroNTING (item 13). This ordering is illustrated in (20). 
Observe that Sittard treats [c] like any other consonant: [cc] is reduced to [c], just as 
[tt] would be reduced to [t]. That is, [c] is not a cluster [cj] in Sittard. 


(20) rək-kə lenc-kə tant-kə 
UMLAUT (15) ræk-kə vac. tent-kə 
S-INSERTION (16) ræœk-skə - - 
N-ASSIMILATION = - - 
k-FRONTING (17) - lenc-ca tent-ca 
PALATALIZATION (1) - vac. tenc-ca 
DEGEMINATION (2) - len-ca ten-ca 


Output reekska lenca tenca 
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Q61 In the dialect of Wehl, a-insertion applies in a somewhat different set of contexts than 


in the standard language, as shown in the following data (Haan 1996). What is the 


difference? 
Standard Wehl 
a. snor snorace snor snyrece ‘moustache’ 
b. pan panəcə pan penəkə ‘pot’ 
Gi kam kaməcə kam kemka ‘comb’ 
d. slan slanace slan slenska ‘snake’ 
e. stal stalace stal stelaka ‘stable (noun)’ 


Q62 The underlying form of the diminutive suffix in the dialect of Utrecht is [ti]. 
Representative noun stems and diminutive forms are listed below (van den Berg 1975). 


a. ti ziyi ‘egg 

b. lap lapi ‘rag’ 

C: droeyf droeyfi ‘grape’ 
d. kra:X kra:xi ‘collar’ 
È: rok nki ‘skirt’ 

f. ko:nan ko:nanki ‘king’ 
g. slan slanayi ‘snake’ 
h. na:lt na:ltsi ‘needle’ 
i. stul stultsi ‘chair’ 
j. mant mantsi ‘basket’ 
k. bom bomayi ‘bomb’ 
I. pan panayi ‘pot’ 

m. be:zem be:zampi ‘broom’ 
n. ra:m ra:mpi ‘frame’ 
0. bal balayi ‘ball’ 

p. snor snoraxi ‘moustache’ 
q. Or o:rtsi ‘ear’ 

r. sxun SXuntsi ‘shoe’ 
Sy pas pasi ‘step’ 

t. kaft kafi ‘cover’ 
u. boxt boxi ‘bend’ 
v. fut futsi ‘foot’ 

w. bort bortsi ‘plate’ 


1. List the alternants of the diminutive suffix. 

2. Of the rules in the derivation in (13), identify the ones that exist in the dialect of Utrecht 
(with replacement of {(t)ja] by [(t)i] where appropriate). 

3. Two additional rules are needed to account for the alternations in the diminutive suffix. 
Determine the ordering of each of these two rules relative to the rules Utrecht shares with 
the standard language. Formulate these rules, using symbols for segments rather than 
distinctive features. (Hint: their formulation can be very simple if they are ordered right.) 

4. Show the derivation of items a, e, k, n, u, v and w, using the presentational format of (13). 
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Q63 A possible diminutive form like [benca] could be derived from six possible stems. 
What are they? Why isn’t [ben] among them? 


Q64 in many respects, the Brabant variety of Dutch as spoken in Alphen is similar to 
Sittard Dutch. First, the phonological form of the diminutive suffix after vowels is 
[ka], as can be seen in the word for ‘egg’, [aika]. Moreover, s-iNSERTION applies just as 
in Sittard. However, like Standard Dutch, it has sa-INSERTION, and lacks UMLAUT. A rule 
which neither Sittard nor Standard Dutch have is vowEL Laxine, which in effect 
shortens the last vowel of the stem. 


VOWEL LAXING V 一 [— tense] / Cy + ka] pm 


1. Determine the order in which VOWEL LAXING, S-INSERTION and 3-INSERTION are applied, on the 
basis of the data below. 
2. Give derivations of items d, i and j. 


a. lap lapka ‘cloth’ 

kra:x kroxska ‘collar’ 
ko:nan ko:nanska ‘king’ 

slan slanaska ‘snake’ 
vlam vlaməkə ‘flame’ 

pan panəkə ‘pot’ 

dæ:m dæmkə ‘thumb’ 
brsam bisamka ‘broom’ 
snor snoraka ‘moustache’ 
oir 2rka ‘ear’ 


Serpe oan o 


7.5 _ CONCLUSION 


The comparison between Standard and Sittard Dutch diminutive formation, together 
with the data in the questions, has shown that different varieties of a language may 
have different underlying forms for what is functionally the same morpheme. Thus, 
the standard language has [tja], Sittard and Alphen have [ka], and Utrecht has [ti] as 
the UR for the diminutive suffix. Also, the sets of phonological rules they have may 
only be partly identical, as illustrated by the fact that Alphen and Sittard both have 
S-INSERTION, While Alphen has 9-INSERTION and vowel LAXING, which rules Sittard lacks, 
while conversely Sittard has uMLAuT, which is absent in Alphen. Varieties may also have 
the same rules, but apply them in different orders, as was the case with DEGEMINATION 
and PALATALIZATION in Standard Dutch and Sittard Dutch. Finally, varieties may also dif- 
fer in having the same rule, or what can be seen as the same rule, but have slightly dif- 
ferent versions of it, one having a wider SD (Structural Description) than the other. 
This was the case with 3-INSERTION, which applies after sonorants in the standard lan- 
guage, but after coronal sonorants in the dialect of Wehl. 
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These differences in a sense reflect the way in which phonological change is 
accounted for in a derivational grammar. First, rules may expand their SDs, and come 
to apply to wider sets of forms. Second, new rules may be added to the grammar, caus- 
ing the difference between the URs and the SRs to be increased. Third, rules may come 
to be reordered, causing different outputs to be produced. And fourth, URs may be 
changed, a process known as restructuring, causing the distance between the URs and 
the SRs to be decreased. These theoretical implications were investigated in Kiparsky 
(1968, 1988) and King (1969). In the model of Optimality Theory, phonological change 
must be understood as either of two things. First, it can result from the reranking of 
constraints, which in current OT are seen as universal. Second, it can be seen as restruc- 
turing, i.e. as the listing of a different UR, something known in OT as lexicon opti- 
mization. We will not work these comments out any further in this course, but see Holt 
2003. In the next chapter we discuss the question whether we need to recognize a level 
of representation between the fairly abstract UR and the phonologically detailed SR. 


Levels of representation 


8.1 INTRODUCTION 


In this chapter, we consider the question of how many phonological representations a 
word has. So far we have postulated two levels of representation, an underlying one 
and a surface one, a position that was motivated in Chapter 4. It can be argued, how- 
ever, that this two-level model does not account for the intuitions of native speakers 
about the pronunciation of the words of their language. In brief, the underlying repre- 
sentation would seem to be too abstract, while the surface representation appears to 
be too detailed (cf. Schane 1971). This unsatisfactory state of affairs has come to an 
end with the advent of Lexical Phonology, which theory postulates an intermediate 
level of representation known as the lexical representation. Not only does it corres- 
pond to native speaker intuitions, it will also be shown to have a number of interest- 
ing properties. After dealing with the lexical representation, we turn our attention to 
the relation between the surface representation and the physical pronunciation of the 
words. We will see that languages may differ in the way they realize identical surface 
representations. Such language-specific realizations of phonological elements are 
accounted for by rules of phonetic implementation. 


8.2 DEFINING AN INTERMEDIATE LEVEL OF REPRESENTATION 


If you look up the pronunciation of a word in a dictionary, you will find it is normally 
given in phonemic transcription. In this type of transcription, an English word like 
pin is transcribed [pin], not [p"in]. This is because the segment [ph] is an allophonic 
variant of the phoneme /p/, and as such has no place in a phonemic transcription. 
Clearly, the dictionary’s phonemic transcription defines a level of representation which 
is more abstract than the surface level. However, it does not correspond with the under- 
lying representation either. The pronunciation of looked, for instance, would be given as 
[lukt], not as [lukd]. The form that is given incorporates the output of a devoicing rule 
that makes obstruents voiceless after voiceless obstruents. That is, the phonemic tran- 
scription apparently corresponds to a level of representation which is somewhere 
between the underlying level and the surface level. 
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It might at first sight be reasonable to suppose that the intermediate level corres- 
ponds to the output of all rules that produce existing segments, which were called 
‘neutralization rules’ in Chapter 4, while all rules that produce novel segments, or allo- 
phonic rules’, would then apply to the intermediate representation so as to produce 
the more detailed surface representation. This would put all the rules that produce 
phonemes, or existing segments, in a different compartment of the grammar from all 
the rules that produce allophones, or novel segments. While this assumption is almost 
correct, there is still something not quite right. This is because there are rules whose 
output is a mix of existing and novel segments. Not all rules allow themselves to be 
characterized as either ‘neutralizing’ or ‘allophonic’. Some rules are both: depending 
on the input, they produce either an already existing segment or a novel segment. 
For instance, Dutch has a rule that voices plosives before [b,d], called REGRESSIVE 
VOICING. (It applies to all obstruents, as shown in Q56, but we will ignore the fricatives 
here.) The language has the three voiceless plosives [p,t,k], each of which can appear 
before /b,d/. REGRESSIVE voicinc thus produces the voiced plosives [b,d,g]. Dutch 
has oppositions between /p/ and /b/, as in [pak] ‘parcel’ - [bak] ‘tray’, and /t/ and /d/, 
as in [tak] ‘branch’ ~ [dak] ‘roof’, but there is no contrast /k/-/g/. Therefore, the 
output of REGRESSIVE VOICING is partly phonemic, viz. when /p,t/ are voiced to /b,d/, 
and partly allophonic, viz. when /k/ is voiced to [g]. In (1), the first column gives 
the underlying representations, while the third column gives the results of REGRESSIVE 
VOICING. 


(1) Underlying Phonemic Surface Gloss 
op dun ob dun [ob dun] ‘put on” 
ceyt bra:k ceyd bratk [œyd bra:k] ‘breakout’ 
zak buk zak buk [zag buk] ‘pocket book’ 


It should be clear that it would not be very clever to maintain a distinction between 
rules that produce existing segments and rules that produce novel segments, for this 
would mean that we would have to split REGRESSIVE VOICING, given in (2), into two sub- 
rules, one to produce the existing segments /b,d/, the ‘neutralizing’ part of the rule, 
shown in (3a), and a second to produce the novel [g], the ‘allophonic’ part, shown in 
(3b). (We formulate the rules so that they voice all obstruents.) 


—son 
(2) REGREssivE voicinc [—son] — [+voice] /__ (##) | —cont 
+voice 
—son —son 
(3) a. REGR. p,t-voICING [cor] | | > [+ voice] /__ (##) | —cont 
[LAB] 十 Voice 
—son 
b. REGR. K-voIcine Ee | 一 [+voice] / _ (##) | cont 
DORSAL +voice 
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Rule (2) converts the underlying forms in the first column in (1) into the forms in 
the third column, which seems just right. There is no reasonable case for maintaining 
an intermediate representation of the type illustrated in the second column in 
(1), which gives the intermediate representations that would result if we were to split 
(2) up into (3a), the neutralizing rule that produces /p,t/,and (3b), the allophonic rule 
that produces [g]. 

Halle (1959) used the above argument, which he made on the basis of a similar case 
of undesirable rule duplication in Russian, to argue that the notion of the 
‘Structuralist Phoneme’, which was a cornerstone of phonological theory as it existed 
before the advent of ‘Generative Phonology’, was misguided. American Structuralists — 
phonologists like Zellig S. Harris, Charles F. Hockett and W. Freeman Twaddell, whose 
work was superseded by the Generativists of the late 1950s and 1960s, like Morris 
Halle, Noam Chomsky and Paul Postal - held that the underlying representation was 
transferred into an intermediate, phonemic representation, which in its turn was 
transformed by rules into an allophonic surface representation. The notion 
phoneme is based on the existence of a surface contrast. The (entirely reasonable and 
valid) idea was that whenever a minimal pair could be formed, i.e. a pair of words 
whose members differ by one segment only, like English lock ~ rock, or beat ~ boot, the 
two segments responsible for the difference must belong to different phonemes. So the 
above two minimal pairs are evidence for the existence of the phoneme categories /|/ ~ 
Irl and /i:/ ~ /u:/ in English. What made their theory problematic, however, was the 
additional premiss that a segment could only belong to one phoneme. So once a /t/ had 
been set up in a language on the basis of some minimal pair in which the segment [t] 
contrasts with some other segment, any occurrence of [t] in any other word, regardless 
of context, also belonged to the phoneme /t/. This premiss has been referred to as ‘Once 
a phoneme, always a phoneme’. Adherence to this principle leads inevitably to the sort 
of undesirable splits in generalizations that we have seen in the case of Dutch REGRESSIVE 
VOICING. Since [d] belongs to the phoneme /d/, a conclusion based on minimal pairs in 
which it contrasts with [t] in the onset, [d] must also be assumed to represent /d/ when 
it occurs in the coda before a voiced plosive, as in the word for ‘breakout’ in (1). Since 
the same reasoning does not apply to [g], the position encapsulated in ‘Once a 
phoneme always a phoneme’ implies the recognition of an undesirable representational 
level illustrated in the second column of (1). 

So we are back at the beginning. It is still the case that dictionaries give phonemic 
transcriptions, and that untrained speakers of English will maintain that [ptin] and 
[spin] contain the same consonant [p]. It is reasonable to assume that their judgement 
is based on some (psychologically real) representation, here one that corresponds 
to a derivational stage before aspiration. But equally, they will maintain that the final 
consonant in looked is not the same as that in begged: in looked a [tj occurs, but in 
begged a [d]. This judgement is evidently based on a representation that arises after 
the application of pevoicine. Halle’s argument makes it clear that there is no place for 
the Structuralist phoneme in phonological theory. It does not, of course, rule out the 
existence of any intermediate representation. The answer to the question of what 


the appropriate intermediate level is has been provided by the theory of Lexical 
Phonology. 
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8.3 LEXICAL PHONOLOGY 


The crucial assumption made by Lexical Phonology is that some of the phonological 
generalizations of a language are stated in the lexicon, the morphological module 
which incorporates the semantic, phonological and morphological information of 
the language’s morphemes, while others are stated outside it. That is, a distinction is 
drawn between lexical phonological rules and postlexical phonological rules, the 
latter applying after the words have been inserted into the sentence (Kiparsky 1982; 
1985; Mohanan 1986). In (4), a number of the distinguishing properties are listed. 


(4) Lexical rules Postlexical rules 

a. May refer to morphological Cannot refer to morphological 
labels labels 

b. May have exceptions Cannot have exceptions 

c. Structure-preserving Need not be structure-preserving 

d. Accessible to native-speaker Not easily accessible to native- 
intuition speaker intuition 

e. Cannot apply across word May apply across word 
boundaries boundaries 

f. Must precede all postlexical rules Must follow all lexical rules 


We discuss each of these properties in the following sections. 


8.3.1 Reference to morphological labels 


Since lexical rules apply inside the lexicon and postlexical rules do not, the former, but 
not the latter, have access to category labels like ‘N(oun)’, ‘V(erb)’, etc. For example, 
there is a rule in Dutch that deletes word-final [n] after [9], as shown in (5). 


(5) Underlying Surface Gloss 
[[lo:p]y + an) ju lo:pa ‘to walk 
[[za:k]y + on] p) zaika ‘things’ 
[teikən]y teika ‘sign’ 
[[te:kon], + on] Inf te:kana ‘to draw’ 
lorpan] Aaj -  oIpa ‘open’ 


When [an] occurs finally in a verb stem, however, no deletion takes place (Koefoed 
1979; Trommelen and Zonneveld 1983). This is shown in (6). 


(6) Underlying Surface Gloss 
[terkan] te:kan ‘draw’ 
[o:pən]y o:pan ‘open’ 


N-DELETION (7) thus distinguishes between the words for ‘a sign’ and ‘to draw’, 
or between the adjective ‘open’ and the verb ‘to open’. Such a condition can only be 
put on a lexical rule. Once a word has left the lexicon and has been inserted in 
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syntactic structure, category labels are removed, and postlexical rules therefore 
cannot refer to them. 


(7) n-petetion «= n 9 O/ 9 __ 
Condition: X # V 


8.3.2 Exceptions 


Lexical rules, but not postlexical rules, have access to the lexicon, and as such can tell 
which word they are dealing with. A rule that has exceptions, therefore, cannot be a 
postlexical rule. To return to the example of n-DELETION (7) above: {'heidon] ‘heather’ 
and ['kriston] ‘Christian’ are exceptional in not undergoing the rule. The entries 
of these words are assumed to be provided with the information ‘Not subject to (7)’. 
Similarly, English has a rule of TRISYLLABIC LAXING, Which laxes a vowel in the antepenul- 
timate syllable of words derived with suffixes like -ity. Examples are given in (8). 
However, the words nicety and obesity (cf. nice, obese) exceptionally have [a1, i:] in the 
antepenultimate syllable, rather than the expected lax [1, el. 


(8) Tense Lax 
div[ai]ne div[I]nity 
v[ei|n v[æ]nity 
serit|ne ser[e]nity 


While postlexical rules cannot have exceptions, lexical rules could either have excep- 
tions or be exceptionless. For instance, English has a rule deleting [n] after [m] at the 
end of the word (Kiparsky 1985). This rule must be lexical, because it needs informa- 
tion about the status of the word before inflectional endings are added. Thus, it applies 
in column 1 in (9), and in column 2, where the words have been provided with inflec- 
tional endings, but not in column 3, where [n] is not final in the word. This lexical n- 
deletion rule is exceptionless: there are no words in English that end in [mn]. 


(9) Stem Inflected form Derived form 
dam[0] dam[Ø]ed, dam[Ø]ing, dam[0] dam[n]ation 
colum[Ø] colum[Ø]s, colum[Ø] colum[n]ar 
hyml[0] hym[Ø]s, hym[Ø] ing, hym[0] hymlnlal 


8.3.3 Structure preservation 


Lexical rules are structure-preserving in the sense'that their output is confined to seg- 
ments that already exist in underlying representations. The idea is that there is a lex- 
ical inventory of vowels, consonants and tones which is smaller than the inventory 
observable in surface representation. For example, since in the underlying representa- 
tion of English words there is no need to distinguish aspirated from unaspirated plo- 
sives, this distinction being allophonic in English, the rule that creates aspirated 
plosives must be postlexical. The segments [p", t", kh] are novel segments, i.e. not 
included in the English lexical segment inventory. 

Structure preservation is not an exceptionless property of lexical rules. A number 
of varieties of English have rules that apply before the affixation of inflectional 
endings and must for that reason be lexical, like the rule that deletes [n] in words 
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like autumn, discussed above. The point is that many of such rules produce novel 
segments (Harris 1994: 28). An example is the Scottish rule lengthening word-final 
vowels (as well as vowels before [十 voice, + cont] segments) (Aitken’s Law cf. Q40). 
Unexpectedly, the inflectional suffix [d] of the past and past participle is ignored in 
the context of the rule, which makes the rule a lexical rule. This is shown in (10). But 
the product of the rule is a novel segment. 


(10) Uninflected stem Past tense form 
fid feed fri:d freed 
taid tide taed tied 
said side saed sighed 
fud food su:d sued 


Interestingly, there is evidence that when a novel sound is produced by a lexical rule, it 
may be made available for inclusion in lexical representations. Thus, some speakers pro- 
nounce concise and scythe with [ae], even though in Scottish English these words end in 
[s] and [6], respectively (Aitken 1984). Clearly, developmental stages in which all occur- 
rences of a novel segment are produced by a lexical rule must be expected to occur, if it 
is assumed that at least some lexical rules historically start out as postlexical rules. 


8.3.4 Native-speaker intuitions 


Native speakers would appear to make reference to the lexical representation when 
determining whether two phonetically different sounds are the same sound or two 
different sounds: their judgements refer to the lexical segment inventory. For instance, 
native speakers of English regard the second segment in stop and the first segment 
of top as the same sound, even though they are phonetically different, which fits with 
the assumption that aspiration is a postlexical rule. Likewise, phonetically identical 
sounds that were neutralized by a postlexical rule will typically be looked upon as dif- 
ferent sounds. As a result of the American English process of FLAPPING, the intervocalic 
consonants in Adam and atom are phonetically identical in all styles except the most 
formal ones (['eram]), but native speakers nevertheless consider them different con- 
sonants. By contrast, when a lexical rule neutralizes an underlying opposition, the 
intuition of the native speaker tends to conform to its output. 


8.3.5 Application across word boundaries 


Because lexical rules apply in the lexicon, their structural description can never be 
determined by elements taken from different words. A rule that applies across word 
boundaries, therefore, must be a postlexical rule. Dutch REGRESSIVE vOICING (2) is a post- 
lexical rule for this reason. 


Q65 


1. What do you think is the lexical representation of ‘breakout’ in (1)? 
2. Do you think that Dutch REGRESSIVE voicinc has exceptions? i 
3. What would native speakers of Dutch say is the last consonant of the prefix in [œyd-bra:k]? 
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8.3.6 Lexical rules apply before postlexical rules 


The final distinguishing property listed in (4) is once more a necessary consequence 
of the lexical phonology model. Words get inserted into postlexical structures in their 
lexical representations, i.e. after all lexical rules have applied. It follows that if we know 
that a rule is postlexical, a rule that must apply after it must also be postlexical, and 
thus display all the postlexical properties listed in (4). 


8.4 PHONOLOGICAL INFORMATION IN THE LEXICON 


All phonological rules may refer to phonological information. The phonological infor- 
mation available in the lexicon is not confined to segments. Syllable and foot structure 
also exist in the lexicon (Booij 1988; Inkelas 1989). Evidence for this position is pro- 
vided by morphological processes that are sensitive to the syllable structure or 
the stress of the base. For example, the English comparative and superlative suffixes 
[[ lag; or] and [[ ]4z ast] require that the base should not exceed a binary foot. Therefore, | 
the formation is allowed with the adjectives in (11a), but not with those in (11b). 


(11) a. white (wait); (waitar)F 
noble (navubl)e (naUblsr)r 
silly (sili); (siliar) - 
b. beautiful (bju:tafl); *(bju:taflar)e 
manifest (mena), (fest); *(mænə);(festər)p 
serene Sa(riin)e *Sg(riinar)e 


An interesting illustration of the point that lexical rules may refer to both morpho- 
logical structure and phonological structure is provided by FINALDEVOICING in two vari- 
eties of German. In Low German, a term covering the varieties of German spoken in 
the northern half of the country, this process applies to syllable-final obstruents, as 
shown in the second column of (12) (Venneman 1972). The requirement that the 
obstruent should occur in the coda also holds for High German, but this variety 
requires in addition that in positions before sonorants the obstruent should be 
morpheme-final. As a result, the examples in (12b), in which the obstruent is both syl- 
lable- and stem-final, show devoicing in both varieties, but differences appear in the 
case of such words as Adler ‘eagle’, where an obstruent is syllable-final, but not stem- 
final, as shown in (12c).! 


(12) Underlying Low German High German Gloss 
a. kind kint kint ‘child’ 
kind-1f kin.dif kin.dif ‘childish’ 


"Rubach (1990) provides an account of the High German data based on resyllabification. Giegerich (1992) 
assumes that the varieties or styles with voiced plosives in words like Adler place the plosive in onset position. 
This solution implies that clusters that are ill-formed word-initially te [dl-, gm-}) are often well-formed word- 
internally: often the opposite occurs, since edges may allow ‘extra’ elements. A case in point is French [ps], 
which can be an onset in psychologie, but not in capsule (cf. Selkirk 1982.) 
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b.  kind-lig kint.lig kint.lig ‘childlike’ 
taig-li¢ te:k.lie te:k.lic ‘daily’ 

c. Vaign3R vatk.naR vaig.naR ‘Wagner’ 
magma: mak.ma: mag.ma: ‘magma’ 
aidlər ait.lor aid.lar ‘eagle’ 
ordn-un ?rt.nU ?rd.nUD order 


Q66 
The Korean lexical consonant inventory is given in (1). 
(1) Labial Cor Cor Dorsal 
[+ant] [-ant] 
p t tf 
pe Et i 
p f 


m 


Before [i], [t,t*,t’,s,s’,n,l] are prepalatal by a rule of PALATAUZATION: [c,c*,c’,f,J’,p,A], as shown in 
(2), causing surface syllables like *{si] to be ill-formed. In addition, there is a rule PLAIN PLOSIVE 
VOICING which voices [p,t,t{,k] between sonorant segments. 
(2) kaps-i kapfi ‘price + NOMINATIVE’ 

sikan figan ‘time’ 

kini kini ‘meal’ 

pthi peri ‘endure’ 


thi di ‘dust’ 

kaksi kakfi ‘bride’ 

mati maji ‘knot 
A rule of AFFRICATION Causes coronal plosives to become affricates before [i], if [i] forms part of a 
suffix. Thus, [t,t"] are replaced with [tf tf] in the contexts shown in (3a). ([t'] happens not to 
occur stem-finally). The affricates [t/, tft] also appear in underlying representation, as shown in 
(3b) (after Kiparsky 1993). 


(3) a. pat-i patfri ‘field + NOMINATIVE’ 
kut-i kud3i 中 arden 十 ADV 
kjyth-i kjytfi ‘side + NOMINATIVE’ 
. tfip tfip ‘house’ 
tfitfy tfidgy ‘tear (IMPp)’ 
tfxtf-i tfydzi ‘milk +NOMINATIVE’ 
IS PLAIN PLOSIVE VOICING a structure-preserving rule? 
Is AFFRICATION a structure-preserving rule? 
IS PALATALIZATION a Structure-preserving rule? 
Mention three properties of AFFRICATION that show it is a lexical rule. 
Mention one property of paLaTaization that shows it is a postlexical rule. 
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Q67 


Pee Ea erg et SES I OUNCE e ne sts See Cee 
French has rule of vowEL NASALIZATION, which nasalizes a vowel before a [n] in the same syllable 
(Tranel 1981). After VOWEL NASALIZATION, a rule of n-DELETION deletes [n] if it appears in the coda 
after a nasalized vowel. This is illustrated in (1), where the feminine forms were created by 
suffixing the stem with [ə]. (The nasal vowel [č] is the reflex of underlying [i,y,e]; please ignore 
the variation between [el and [€].) 


(1) Stem ; Fem. Derivation 
plen ‘full’ č plena plenityda ‘fullness’ 
bryn ‘brown’ brynə bryne ‘brownish’ 
fin ‘fine’ finə fines ‘fineness’ 
bən ‘good’ 5 bona bonifije ‘make good’ 
roman ‘Romance’ roma romana romanist ‘scholar of Romance’ 


There is a third rule, 3-DELETION, which deletes foot-final [a] except in the most formal styles, 
causing the feminine forms in (1) to be [plsn, bryn, fin, bon, roman] on the surface. 


Give formal notations of the three rules, referring to syllable and foot boundaries as in 
example (7) in section 6.3.1. 

Show the derivation of the masculine and feminine forms for ‘fine’. 

Before a consonant-initial noun, the feminine indefinite article is [yn], as in [yn fam] ‘a 
woman’; before a vowel-initial feminine word like [e.ro.in] ‘heroine’, the indefinite article 
[yn] is divided over two syllables, as in [y.ne.ro.in] ‘a heroine’. The masculine form of the 
indefinite article is [€] before consonant-initial nouns, as in [č gars3] and [En] before vowel- 
initial nouns, where the [n] is syllabified with the noun-initial vowel, as in {é.nom] ‘a man’. 
How would you account for the preservation of the [n] in the masculine indefinite article? 
Why must n-DELETION be postlexical? 

In some contexts, word-final [a] is preserved, as in [bada desine] ‘strip cartoon’. Is 
3-DELETION a lexical or a postlexical rule? 

Try and find arguments for the assumption that vowEL NASALIZATION is a lexical rule. 

There are many words like [5da] ‘wave’, which only ever have a nasalized vowel, never an 
oral vowel plus [n]. What would you assume as the underlying form of the word for 
‘wave’? Motivate your answer. 


8.5 CONTROVERSIAL PROPERTIES OF LEXICAL RULES 


Two further properties have been claimed to distinguish lexical from postlexical rules. 
However, they do not appear to have the same fairly unexceptional status as the prop- 
erties mentioned in (4), and have therefore been disputed (Halle and Kenstowicz 
1991; Kiparsky 1993). One of these is the property of cyclicity, the application of a 
rule to internal constituents of derived words in addition to the application to the 
derived word itself, which has been thought to be an exclusive and necessary property 
of lexical rules. It will briefly be illustrated in Chapter 14. The second is Non-DeriveD 
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ENVIRONMENT BLOCKING. This is the phenomenon that many lexical rules would appear 
to skip underived words, i.e. to apply only to forms that are derived. For instance, 
English TRISYLLABIC LAXING applies to derived forms like ['venati] vanity, [da'vinati] 
divinity, but not to underived forms like ivory [‘atvori] *['1vari], nightingale 
[‘naitingeil] *['n1tiygerl]. Accordingly, it has been proposed that lexical rules can only 
apply to derived forms (e.g. the Strict CYCLE Conpition of Mascaró 1976). However, 
many cases have been presented in which structure-preserving rules apply to all 
occurrences of a morpheme, derived or underived (e.g. Hyman 1970; Kenstowicz and 
Kisseberth 1979: 77). Kiparsky (1993) argues that NON-DERIVED ENVIRONMENT BLOCKING 
is actually a byproduct of underspecification. 


Q68 in Dutch, all obstruents are devoiced in the coda. As we saw in Chapter 4, this rule is 
responsible for the alternation between singular forms and plural forms like [pat] ‘toad 
(Sing)’ and [pada]. Does the rule show the effect of Non-Deriveo ENVIRONMENT BLOCKING? 


Q69 In Northern Irish English, all occurrences of [s:] can be derived from [ia] (Harris 1994). 
Some alternations are shown below. 


[iə] [e] [ia] [e] 


fate day station pay them 
made stayed same say more 
j rays fail daily 
playful 


Is the rule that produces [e:] a lexical rule? 

ls it a neutralization rule? 

ls it structure-preserving? 

Does it show the effect of NoN-DERIVED ENVIRONMENT BLOCKING? 


8.6 BEYOND THE SURFACE REPRESENTATION 


In summary, the phonological grammar advocated by Lexical Phonology can be 
schematized as in (13). 


(13) | UNDERLYING REPRESENTATION 


Lexical rules 


J 
LEXICAL REPRESENTATION 


Postlexical rules 


l 


SURFACE REPRESENTATION 
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The surface phonological representation which is produced by the grammar contains all 
the information the articulators require in order to do the work of pronouncing the 
linguistic expressions concerned. In other words, it is a fully specified phonological rep- 
resentation. Converting that cognitive representation into the physiological actions that 
constitute the articulation of the expression is the task of the phonetic implementa- 
tion rules. The diagram in (13) can therefore be completed as in (14). 


(14) UNDERLYING REPRESENTATION 


Lexical rules 


l 


Phonology 
LEXICAL REPRESENTATION 


Postlexical rules 


J 


SURFACE REPRESENTATION 


Phonetic implementation rules 
4 Phonetics 


ARTICULATORY PROGRAMME 


8.6.1 Phonetic implementation 


The phonological surface representation which ultimately results from the phono- 
logical grammar consists of some configuration of phonological features and structures. 
The representation is discrete, in the sense that any feature or structural configuration 
is there or is not there. The translation of this discrete representation into quantitative 
physiological activity, the articulatory actions discussed in Chapter 1, is taken care of 
by rules of phonetic implementation (Pierrehumbert 1980; 1990; Keating 1990). It 
used to be thought that such rules were universal, and that for that reason anything 
that was language-specific needed to be given a place in the language’s phonology. 
Everyday experience tells us that this is an improbable position. For one thing, the 
same phonological syllable is usually pronounced somewhat differently by speakers of 
different languages. For instance, [kis] will typically be pronounced with a somewhat 
more fronted [k] in French than in either Spanish or Dutch, while [s] will have lower 
frequency friction in Dutch than in either French or Spanish. Second, speakers of the 
same language may fine-tune the articulation of certain segments in order to signal 
the social group they belong to. As a result, we can often form a fair idea of the 
speaker's social background on the basis of the precise quality of their vowels, say. But 
even though speakers can fine-tune their articulations with great precision, postulat- 
ing ever more discrete phonological features to capture segmental differences which 
never appear to be used contrastively in any language would defeat the phonological 
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enterprise, which is to account for the fact that the phonologies of the world’s lan- 
guages are formally constrained and can be described with the help of a small, or at 
the least a finite, set of discrete elements. The detailed intra- and interlanguage vari- 
ation in the implementation of the phonological representation, therefore, is to be 
accounted for in the phonetic implementation. 


8.6.2 Models of implementation 


Ina widely adopted view, phonetic implementation involves the translation of a phono- 
logical feature into a target for the designated articulatory parameter. For instance, the 
feature [一 cont] triggers the formation of an oral closure at the location specified by 
the place features. A feature like [+ nasal] will trigger an opening of the velum for the 
duration of the segment specified for that feature, while [—nasal] will do the opposite. 
The different targets of the same articulatory parameter will be connected up by tran- 
sitions known as interpolations. Thus, the movement of the velum from a raised to 
a lowered position constitutes the interpolation between two phonologically specified 
target positions. This view of phonetic implementation implies two things. One, pho- 
netic implementation rules do not themselves add or remove phonological features: the 
only thing they can do is translate features into articulations. Second, segments that 
have no specification for a particular feature may be in the path of an interpolation 
between two segments that do. For instance, Cohn (1990) shows that the nasalization 
of the vowel in American English words like den, room, sung should be described in 
just that way. The nasalization starts early in the vowel and progressively increases to 
reach full velic opening at the end of the vowel. This is shown graphically in (15a), 
where the thick line represents the nasal airflow, or — indirectly ~ the degree to which 
the velum is lowered. Cohn argues that in (15a) the vowel has no specification for the 
feature [nasal], and that its nasality results from the interpolation between the 
[一 nasal] specification of the preceding consonant and the [+ nasal] specification of 
the following consonant. By contrast, in words like camp, dent, sink, the velum opens 
fully right at the start of the vowel, and closes fairly suddenly at the moment when the 
oral contact for the voiceless plosive is made. In fact, the nasal consonant disappears 
before a voiceless plosive in the coda, so that the nasality of the vowel can be seen as the 
transfer of that feature trom the deleted [m,n,n]. This is shown in (15b). (The voiceless 
plosive is accompanied by a glottal closure by an independent rule of GLOTTALIZATION.) 
The opposite pattern is found in words like mood, Ned, where the nasality ‘cline’ runs in 
the opposite direction, as shown in (15c). Thus, only in (15b) does the nasality of the 
vowel result from a [+ nasal] feature on the vowel, which is supplied by a phonological 
rule nasalizing vowels before a sequence of a tautosyllabic (i.e. occurring in the same 
syllable) nasal and plosive. 


(15) a. | | 是 b. | | ca | c. 7 | | | 
d e n d é tt n £ d 
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An alternative view holds that phonological features are translated into gestures. 
That is, instead of defining the beginning and end-points of articulatory movements, 
the elements in a gestural model are the articulatory movements themselves (Zsiga 
1997). Under this view, the phonological features in the syllable [den] are translated 
into a laryngeal gesture (voicing), two tongue tip gestures (one for [d] and one for [n]), 
a tongue body gesture (palatal wide, to produce the quality of [£], and a velic closing 
and a velic opening gesture. Their timings are governed first by the order of the seg- 
ments and second by the more detailed, language-specific instructions. If the opening 
gesture of the velum is initiated at the same time as the tongue tip closing gesture, there 
will be some nasalization during the last part of the vowel: the velum opens immedi- 
ately the gesture is started, while it will take the tongue tip some time actually to reach 
the alveolar ridge and, with the rims of the tongue, make the alveolar closure. In a lan- 
guage that allows a greater degree of nasalization, the velic opening gesture is extended 
so that it starts earlier. 

The developers of the gestural model had originally intended the gestures to 
be the phonological features encoding the pronunciation of words in the lexicon 
(Browman and Goldstein 1989). A problem with this very ‘phonetic’ view of phono- 
logical representations is that it makes it difficult to state the contrasts a language 
employs. It would, for instance, be impossible to say whether the nasalization 
during a vowel was contrastive or allophonic. A related assumption was that 
phonological adjustments arose from extending the duration of gestures, such that 
gestures come to overlap (as in the hypothetical case of the nasalization of prenasal 
vowels noted above). This, however, makes it impossible to distinguish between 
phonological, i.e. categorical effects (which typically cause gestures to disappear 
or be reordered) and gradient phonetic effects (which can be very adequately 
expressed by adjusting the durations of gestures). The gestural model is thus better 
suited for dealing with the phonetic implementation than for encoding underlying 
forms. 

Recently, phonetic implementation has been thought of less as the construction of 
an acoustic signal on the basis of gesture-based or target-based interpretations of 
phonological features, but more as the probabilistic selection of auditory representa- 
tions remembered by the speaker, known as ‘exemplars’. Speakers appear to modify 
the pronunciation of specific words on the basis of the pronunciation of other speak- 
ers who they recently heard pronounce those words. These word-by-word adjust- 
ments suggest that we store detailed phonetic information about individual words. A 
well-known argument for the fact that we store minimal information about a word’s 
phonology is that we cannot usually say what any word of our language, piano say, 
sounded like when we first heard it, and are typically unable to say whether the 
speaker was male or female, adult or child or what its pitch pattern or speech tempo 
was (Halle 1985). That may be true, but apparently we do remember the last time 
we heard it! Tacitly or overtly, we remember a great deal about speakers and they way 
they said things, and we may use that information during our own speech production. 
There is no conflict between the two facts, however, since one refers to our grammar, 
our knowledge of contrasts and the structures they are embedded in, while the 
other refers to how we pronounced them (Pierrehumbert 2002). Models of phonetic 
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implementation that are based on word-based phonetic knowledge are known as 
‘exemplar models’. 


8.6.3 Deciding between phonology and phonetic implementation 


It is not always obvious whether a particular generalization should be accounted for in 
the phonology or in the phonetic implementation. A reasonable assumption would be 
that if the output of the rule crucially feeds into another rule, the regularity must be 
phonological. An example may clarify this point. In many varieties of British English, 
including RP, there is a tendency to insert a voiceless plosive between a nasal and a fol- 
lowing fricative, if they belong to the same syllable. As a result, a word like sense is pro- 
nounced as if it was the plural of cent, i.e. as [sents] rather than [sens]. At first sight, 
there would appear to be two possible ways of accounting for this process. First, it 
might result from the details of phonetic implementation. For [n], there is an alveolar 
closure and the velum is lowered. To get to [s], the alveolar closure must be gradually 
released while the velum must be raised. The occurrence of the [t] will now depend on 
the precise timing of these two articulatory gestures: if the velum is raised before the 
alveolar closure is released, an oral stop will come to exist between the moment the 
velic closure is made and the alveolar closure is released. (The vocal cords will typi- 
cally stop vibrating once the velic closure is made, and the stop therefore is voiceless.) 
This entirely plausible account would not appear to be right in the case of RP. As it 
happens, this insertion of [t] creates the context for a phonological process which 
glottalizes voiceless stops in the coda of the syllable. If the insertion of the voiceless 
stop in sense is recorded in the phonological representation, we should expect it to 
trigger PREGLOTTALIZATION. The fact is that such inserted stops do trigger PREGLOTTALIZA- 
TION, just as underlying stops do, and voICELESS sToP INSERTION therefore must be a phono- 
logical process in RP. Incidentally, a preglottalized [t] also occurs in words like 
sensitive, concert, which is explained by the fact that [s] in this type of word belongs to 
the first as well as to the second syllable. (This type of ambisyllabic consonant will be 
examined in Chapter 10.) 


Q70 Would you expect sense and cents to be homophonous in British English? 


By contrast, the equivalent process in American English is an instance of phonetic 
implementation (Ohala 1986). The first syllable in a word like teamster or sensitive 
may well be followed by an intrusive voiceless stop, but it does not feed into the 
phonology of the language: these syllables appear to be longer than syllables closed by 
a voiceless stop would be in the same context. Also, words like sense and cents are dis- 
tinct. Cents has a categorically nasalized vowel, as well as a glottal stop ([s&?ts] (see 
(15b)), while sense has a partially nasal vowel, and no glottal stop, a pronunciation 
corresponding to (15a): [seén(t)s]). Thus, in an SPE-type description, the British 
English situation is captured by (16), while the American English situation is best 
described by some such instruction as in (17). 
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120 ce 一 


一 cont 
(16) BrE srop INSERTIONG —> | 一 voice | /a [... toas | 一 | son’) | ia 
QPLACE +cont 
QPLACE 


(17) AmE stop insertion (Implementation): When realizing an intra-word sequence 
[+nas]-[—son, +cont] before an unstressed syllable, time the velic closure for [s] 
just before or at the moment of the release of the alveolar closure for [n]. 


Q71 Before voiced obstruents, vowels are longer than before voiceless obstruents. 
Languages vary in the amount of lengthening before voiced obstruents. The difference in 
French ([vid] ‘empty’ vs [vit] ‘quickly’) is much smaller than in English ([bi:d] bead vs 
[bi:t] beat), for instance. How would you account for this lengthening before voiced 
obstruents? 


English [s] before [f], as in stocks shelves, trace shapes, often assimilates to a 
consonant that is indistinguishable from [f], but is also frequently pronounced 

as a fricative that gradually moves from [s] to [f] (Holst and Nolan 1995). How would 
you characterize these two situations in terms of the rule typology discussed in 

this chapter? 


8.7 CONCLUSION 


We have shown that Lexical Phonology provides a satisfactory answer to the problem 
of the definition of a level of representation between the underlying representation 
and the surface representation. Instead of drawing the dividing line between rules of 
neutralization and rules of allophony, it assumes that phonology exists in two separate 
components in the grammar. One part resides in the lexicon, where it can refer to 
information present in the lexicon, which is, first, morphological information, and 
second, the set of lexical phonological segments. The other part is outside the lexicon, 
where morphological information is no longer available and segments can be pro- 
duced that do not form part of the lexical segment inventory. A number of properties 
that can be associated with lexical rules contribute to the coherence and explanatory 
power of this distinction between lexical and postlexical rules. 

Postlexical rules must be distinguished not only from lexical rules but also from 
phonetic implementation rules. While postlexical rules are phonological rules, and 
can therefore only manipulate the phonological elements that representations consist 
of (features, segments, moras, etc.), phonetic implementation rules can only translate 
the phonological representation into actions of the articulators. The effect of phonetic 
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implementation rules may resemble the effect of phonological rules, and in practice it 
is not always easy to tell when a rule is phonological. 

It might be thought that if lexical rules can be sensitive to morphological informa- 
tion, postlexical rules will be able to refer to syntactic information. There is strong evi- 
dence against this assumption. Rather, it would appear that postlexical rules refer to 
representations that are part of a phonological constituent structure, known as the 
prosodic hierarchy. The theory of Prosodic Phonology, to be discussed in Chapter 
15, provides an answer to the question of what that constituent structure is and how it 
might be related to syntactic structure. 


Representing tone 


9.1 _INTRODUCTION 


As explained in Chapter 1, tone languages use pitch contrasts to keep words apart, in 
the same way that languages use vowel and consonant contrasts for this purpose. 
Mandarin, for instance, has four lexically contrastive pitch patterns. In (1a,b,c,d) these 
pitch patterns are illustrated with four words that minimally differ for tone,a minimal 
quadruplet’. In (1e) a fifth word is given, which has the same tone as (1d) but has a dif- 
ferent consonant, and so forms a minimal pair with (1d). Other languages that use tone 
contrasts for lexical purposes can be found in south-east Asia, Africa and the Americas. 


0 ce gee es: ae. 
a. baa b. baa c. baa d. baa e. maa 
‘eight’ ‘pull up (grass)’ ‘grasp’ ‘father’ ‘scold’ 


Many languages, like most Austronesian and Indo-European languages, do not use tone 
contrasts for the specification of their morphemes. Instead, tone patterns are independ- 
ently meaningful: they add a discourse meaning to the expression. Thus, in English, the 
same expression, Jonathan, sounds as an answer to a question in (2a), as a question in 
(2b), and as a reminder or a question in (2c). This use of pitch is known as intonation. 


a Na Le A 


a. Jonathan b. Jonathan c. Jonathan 


In this chapter, we will be concerned with the question of how pitch is represented in 
phonological structure. We will do this mainly on the basis of data from tone languages. 
We will see that the information about linguistic pitch is represented in terms of tones, 
and that tones, on the one hand, and vowels and consonants, on the other, are arranged 
on separate, parallel structural tiers. We will consider various arguments for this 
autosegmental status of tone. On the basis of data from various African languages, we 
will show that the tonal patterns of words can be insightfully described in a two-tier 
model. We will discuss the way the tones are associated with the other elements in the 
phonological representation on the basis of Goldsmith’s Association Convention and 
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the No Crossinc Constraint. A second argument will be based on the phenomenon that 
vowels may be deleted but their tone be preserved, a phenomenon known as 
stability. Finally, we will introduce and discuss the status of the OBLIGATORY Contour 
PRINCIPLE. 


9.2 THE INADEQUACY OF A LINEAR MODEL 


So far, we have assumed that segments form a single layer of structure, a single string 
of feature matrices. This linear conception of segmental structure, in which every seg- 
ment is a self-contained list of features, makes it impossible to represent aspects of 
pronunciation that characterize more than one segment as a single feature. However, 
languages frequently treat particular aspects of pronunciation, notably tone, nasality 
and tongue-body features, as if they belonged to whole sequences of segments. For 
instance, if a tone language were to have words whose syllables were either all high- 
toned or all low-toned, then it would make sense to say that each word of the language 
either had the feature [+hightone] (‘H’) or [—hightone] (‘L).In the SPE representation 
of feature matrices, we would not be able to express this generalization: every segment, 
or at least every vowel, would have to be specified for [hightone]. If, however, we were 
to represent the tonal features and the other features as two separate strings, we would 
be able to have different numbers of entities in the two strings, which would allow us to 
say that a word consisted of six segments, say, but of only one tone. We show the differ- 
ence for a hypothetical high-toned word [ta ta] in (3): (3a) gives the linear SPE situa- 
tion (where [a] stands for the rest of the distinctive features in that matrix) and (3b), or 
equivalently (3c), gives the representation with two parallel strings. 


(a | ot t b. tata é* tata 


[+hightone] H 
Goldsmith (1976) characterized the SPE position as a theory that adheres to the 
ABSOLUTE SLICING Hyporuesis: the phonological representation of a word is given by mak- 
ing a number of clean cuts along its time axis, each slice so produced being a segment. 
The representation which he argued should be adopted instead is one in which the 
cuts for different features may be made in different places. Because this procedure will 
lead to one or more strings of features in addition to the string of (depleted) feature 
matrices, he used the term autosegment to refer to any feature that had come to be 


removed from the others, while the separate strings of autosegments and depleted fea- 
ture matrices are known as tiers. 


a a 
十 hightone| "| +hightone 


9.3 WORD-BASED TONE PATTERNS 


If, regardless of the number of syllables, the words of a language were either entirely high- 
toned or entirely low-toned, an imaginary situation briefly considered in the previous 
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section, it would obviously be economical to specify each word once either for low tone 
or for high tone. Put differently, we would wish to say that the language had two tone 
melodies, H and L, and that each word had either the one or the other. A real, somewhat 
more complex case is presented by Etung. Each syllable in an Etung word has high, low, 
falling or rising tone. Importantly, however, there are certain constraints on the distribu- 
tion of the tones: not all tones can appear on all syllables. The symbols used in the tran- 
scription of tone are given in the first column of (4), and the tonal representations in the 
third. 


(4) ' high tone H 
‘low tone L 
falling tone HL 
4 rising tone LH 
falling-rising tone HLH 
y rising-falling tone LHL 


In (5a,b,c), we list actual words of one syllable, two syllables and three syllables, respect- 
ively. Notice that level tones (i.e. high tone and low tone) may occur on all syllables, but 
that contour tones (falling tone and rising tone) only appear on monosyllables and on 
the final syllable of disyllabic words. Moreover, if a syllable has the same tone as the pre- 
ceding syllable, all following syllables have that tone. Thus, the words in (5a,b,c) are all 
fine, while those in (5d,e,f ) are all bad. In (5d), we see that contour tones must be final, 
in (5e) we see that contour tones cannot appear in words of more than two syllables, 
while in (5f) we see that in words of three syllables, the first two cannot have the same 
tone if the third has a different tone. 


(5) a. monosyllabic words kpa, kpé, na nd 
b. disyllabic words dba, èkát, òbô, dda, abd, 
c. trisyllabic words 全 de okpuga bisdné, édimbá 
d. disyllabic sb "ibd í 
e. trisyllabic *4bimbâ, *ábimbà, “abimba, *àbimbă, . . . 
f. trisyllabic *4bimba, *àbimbá 


Leben (1973) argued that such distributional facts can be captured by assuming that 
tones are not chosen per syllable, but per word, and that, moreover, these tone patterns 
are independent of the vowels and consonants. These Etung tone patterns are given in 
(6). Importantly, tonal tier and segmental tier are unassociated in the lexicon: the point 
is that all and only the well-formed lexical patterns of (5) can be derived by an associ- 
ation algorithm, so that we account for the ungrammaticality of the forms in (5d,e,f). 
To achieve this, Goldsmith (1976) provided the Association Convention (7). The timing 
of an autosegment with the rest of the representation is expressed with the help of an 
association line. The phonological element with which the tone associates is known as 
the tone-bearing unit, or TBU. It is usually either the syllable or the vowel (whereby 
long vowels count for two vowels). 
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(6) A word has one of the six patterns L, H, LH, HL, LHL, HLH 
(7) ASSOCIATION CONVENTION (AC) 

a. Associate tones and TBUs left-to-right, one-to-one 

b. Associate left-over TBUs with the last tone 

c. Associate left-over tones with the last TBU 


In (8) a number of examples have been worked out. In (8a) the number of tones and 
the number of TBUs is equal, and one-to-one association is all we need to do (clause 
(7a)). In (8b,c), there are fewer tones than there are syllables. According to clause (7b), 
we must now associate the last tone with the left-over syllable(s), an operation known 
as spreading. Notice that this causes the same tone to be associated with more than 
one TBU. In (8d), there are more tones than there are TBUs. The last TBU receives the 
left-over tone (clause (7c)), causing this TBU to be associated with a contour tone. 


(8) Lexical representations 


a. edimba b. bisone c. ekue d. obo 
HLH LH H LHL 
(7a) 
edimba bisone ekue obo 
boli | | | | 
HL H LH H LHL 
(7b) 
bisone ekue 
| 
LH H 
(7c) 
obo 
| \ 
i ‘ LHL 
édimbá bisóņé ékué òbô 
‘pot’ ‘wife’ ‘forest’ ‘arm’ 


Etung has no falling-rising or rising-falling words of the type *na or *na. Apparently, 
there is a limit to the number of tones that can associate with a TBU in this language. 
This No Crowning Constraint, which appears to be quite common in languages gener- 
ally, is given in (9). 


(9) NO CROWDING CONSTRAINT * TBU 


FTF 
The above description accounts elegantly for the distribution of the four surface tone 
types (high, low, falling, rising) of Etung. If we were to elevate those surface tone types 
to features inside a segmental feature matrix, a description would result that would 
consist of a series of arbitrary facts. In (10), this alternative is made explicit. It should 
be compared to the autosegmental alternative consisting of (6), (7) and (9). 


(10) a. A syllable has one of the four tones High, Low, Fall, Rise. 
b. Fall and Rise never occur on a nonfinal syllable. 
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c. Words of more than two syllables never have a Fall or a Rise. 
d. Words of more than two syllables never have the same tone on the 
first n syllables if the nth+1 syllable has a different tone. 


The TBU may be defined differently for different languages. In some languages the 
syllable is the TBU, so that long vowels count as one TBU. In addition to vowels, sono- 
rant consonants in the syllable rhyme may be TBUs. In the next chapter, we will intro- 
duce the notion ‘mora’, which will enable us to say that the most common TBU is the 
sonorant mora (Hyman 1992). 


9.3.1 Language-specific associations 


The Association Convention (7) is not an obligatory procedure in tone languages. For one 
thing, there may be language-particular rules which override it. In Kikuyu, the associa- 
tion proceeds from the second TBU in the word, rather than the first. The word in (12), 
which means ‘way of releasing oneself quickly’, is made up of the morphemes in (11). 
The verb root extension [ang] ‘quickly’ is toneless. The association of the tones con- 
tributed by the various morphemes proceeds as predicted by (7), with the last H spread- 
ing to the left-over TBUs, except that instead of associating to the first TBU, the first tone 
associates with the second TBU (12a). Subsequently, the first empty TBU is associated 
with the first tone, as shown in (12b) (Clements and Ford 1979). An interrupted associ- 
ation line represents an instruction to establish the association concerned. 


(11) mo e rek+ang erie, 
L H L H 
CLASS PREFIX REFLEXIVE allow+ quickly noun surrix 


(12) mo e re ka pge rie 


L L H 


Q73 The Kikuyu words for ‘firewood’ and ‘tree’ are [rok6] and [moto]. 


What are the underlying tone patterns of these words? 

In Tharaka, a related Bantu language spoken in Kenya, these words have the same 
underlying tone patterns, but are realized as [rok6] and fmot6]. How would you describe 
this difference between Kikuyu and Tharaka? 

The Tharaka word for ‘way of releasing oneself quickly’ is made up of the same morphemes 
and the same tones as in (11). How will it be realized in Tharaka? Provide the derivation. 


Thus, languages may have specific constraints on associations of particular tones. For 
instance, Hyman and Ngunga (1994) show that the TBU which the (first) tone associates 
with may be determined by the morphological class, as it is in the verb forms of Ciyao. 
Even in languages like Etung, whose word tonal patterns are evidently governed by the 
ASSOCIATION CONVENTION (7), there may be exceptional patterns that cannot so be accounted 
for. Likewise, there is no universal requirement that tones should always spread to empty 
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TBUs. Pulleyblank (1986) shows that in Tiv, noninitial TBUs may remain unspecified for 
tone until a point in the derivation when they receive a default L. Neither need it be the 
case that left-over tones always form contours on the last TBU. Languages may not tolerate 
contouring at all, obeying an even more restrictive constraint than (9). 


Q74 Zulu (Laughren 1984) obeys a more restrictive constraint than the No CROWDING CONSTRAINT 
(9), since no contour tones may appear on short vowels at all. In addition, Zulu words 
behave either like Kikuyu words or like Etung words. That is, every word is lexically 
specified as to whether association starts with the first or the second TBU. In Natal Coast 
Zulu, the word for ‘chiefs’ consists of two morphemes, /ama/ and /kosi/, both of which 


have underlying HL. The first morpheme is specified for association to start with the first 
TBU, the second for association to start with the second TBU. After this initial association, 
the rest of the derivation is taken care of by the Association Convention in (7). 


1. How will the word [amakosi ] be realized? 
2. Why is the realization not [amakosi]? 


While the Association Convention in (7) may be seen as representing an unmarked pro- 
cedure, evidently languages may decide to introduce language-particular instructions 
or choose to have representations that deviate from its predictions. The only uncon- 
troversially universal aspect to autosegmental representations is the No Crossinc 
Constraint, given in (13). It is an inalienable feature of the autosegmental representa- 
tion: if it did not hold, the order of the autosegments on different tiers could change 
relative to the orders on other tiers. The constraint in (13) is there to prevent such 
order changes. 


(13) ”No Crossinc CONSTRAINT: Association lines do not cross. 


In the next section we give a second argument for the autosegmental status of tone. 


9.4 STABILITY 


There is another argument .for the autosegmental representation of tone, which 
Goldsmith (1976) refers to as the stability of tones. Sometimes, a vowel is deleted, but 
the tone it had remains, showing up on an adjacent vowel. In (14) some examples are 
given from Etsako, in which a nonhigh vowel is deleted before another vowel 
(Elimelech 1976). When the final vowel of the stem is deleted, its tone shows up on the 
next vowel, where it creates a contour. If the tone of the deleted vowel is the same as 
the tone on the following vowel, however, nothing happens. In Etsako, reduplication of 
the noun has the meaning ‘every’. 


(14) ikpa ‘cup’ ikpikpa ‘every cup’ 
ówà ‘house dwowa ‘every house’ 
dyédé ‘banana’ Jyèdôyèdé ‘every banana 
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It is of course possible to describe these facts in a theory in which all features are con- 
tained in segment-size matrices. We could, for example, assume the feature analysis 
of the low, high, falling and rising tones in (15) and write (16) to account for the tone 
changes. In prose, (16) says that a word-final vowel is deleted before a word-initial 
vowel, and if it has a different tone from the word-initial vowel of the next word, the 
tone on the word-initial vowel becomes rising if it was high and falling if it was low. 
While it correctly describes the situation, it presents the facts as arbitrary: we could 
just as easily write a rule with [—contour] rather than [+contour], but that rule would 
not describe any actual language data. 


(15) 和 
a 
l 1 
(16) Mahe ee . eee ae +92 <[+contour]> 


The example shows that the tone and the vowel must not form a single segment, for other- 
wise the deletion of the vowel would imply the deletion of the tone. The autosegmental 
representation allows us to express the fact that it is only the vowel, not its tone, that is 
deleted. By reassociating the stranded tone to the next vowel, we accurately express the 
tonal adjustments. In (18b) a rise is produced, in (18c) a fall, while in (18a) no tonal 
change is predicted. Associating the same feature more than once to a structural element 
is meaningless, and the two identical tones are automatically reduced to one by a conven- 
tion known as the Twin Sister Convention (see (17) Clements and Keyser 1983). 


(17) TWIN SISTER CONVENTION V V 
八 > | 
T g Ti 
(18) a. ikpa ikp a b. owa owa c. xede syede 
a a Et | ie ae al 
k-ta Eek HL HL LLH LLH 
ikp ikp a ow owa oved yede 
| [on] Hl | 
L i Li iL HL CHL LE H LH 
0 (by 17) 


9.5 POSTLEXICAL TONE 


Not all tones are lexical. Intonation languages only provide tonal representations 
postlexically. These tones are independent morphemes, whose meanings are somewhat 
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vague, having to do with whether you are telling the listener some information or are 
asking for some, but no attempt to define them will be made here. A distinction can be 
made between tones that are associated with the primary stressed syllables of words 
and tones that are located at the edges of larger domains. The former are pitch accents, 
while the latter are known as boundary tones. Like word melodies, pitch accents may 
consist of one or more tones, but only one of them associates with the stressed syllable. 
(Pitch accents may form single morphemes with boundary tones.) In (19), an example 
is given of an English sentence with an intitial boundary L and a final boundary H, 
which has a pitch accent H*L on the primary stress of Jonathan. The tone followed by * 
is the tone that associates with the primary-stressed syllable; the other tone of the pitch 
accent is pronounced closely after the associated tone. The boundary tones are realized 
on the initial and final syllables, respectively. When Jonathan is said in isolation with 
the same intonation pattern, the final H will obviously be realized on the syllable -than 
(cf. (2c)). 

And if the main stress in the sentence were to be moved to been, the pitch accent 
H*L would associate with that syllable, causing the pitch peak to be located over been. 
See further Ladd (1996), Gussenhoven (2004). 


(19) ee 


[ has Jonathan been yet ] 


L H*L H 


Q75 Ifthe intonation pattern used in (19) of the text were to be used on the sentence 
Has John?, where the main stress is on John, what would be the pitch pattern on 


the syllable John? 


9.6 THE OBLIGATORY CONTOUR PRINCIPLE 


The success of the autosegmental account of the word tone patterns in languages like 
Etung depends in part on the decision not to represent the tone of each syllable as a 
separate tone in the representation. Crucially, adjacent syllables with the same tone 
need not each be specified separately for tone, since the same tone can associate with 
more syllables through spreading. Considerations like these have led to the assump- 
tion that identical tones cannot be adjacent in the same word. This restriction on rep- 
resentations goes by the name of the OBLIGATORY Contour PRINCIPIE (OCP), formulated 
by Goldsmith (1976) with reference to Leben (1973), which forbids the occurrence of 
identical adjacent tones in the same morpheme. 


(20) OBLIGATORY Contour PRINCIPLE: H y OW 
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Another argument for the OCP is that rules changing or deleting tones usually delete 
whole strings of what on the surface appear to be sequences of identical tones. If we 
assume that the string of identical surface tones is in reality a single, multiply associ- 
ated tone, a ready explanation is found for this phenomenon. An example, from Odden 
(1986), comes from Shona. The ‘associative prefix’ [né] ‘with’ lowers any consecutive 
H-tones to low. Clearly, if these words have only one H, rule (22) can change that H into 
L, thus accounting for these data, as shown in (23). 


(21) mbwá ‘dog’ 
hóvé ‘fish’ 
mbúndúdzí ‘army worms 
ne 
(22) H-L/ | 
ne 


(23) | ne-mbundudzi— ne-mbundudzi 


bale bo “Ge 
H H H í 


By contrast, if each of the three syllables of ‘army worms’ were to have-its own H, a 
more complicated analysis results. We must apply a rule deleting H after [né] provided 
the following tone is also H (which rule will apply more than once to the same word in 
a case like ‘army worms’), then change the remaining H-tone into L, and third, spread 
that L back to the preceding toneless syllables. Clearly, having a single H-tone in the 
representation makes life a lot simpler! 


Q76 How does the derivation of ‘every cup’ proceed if, as argued by Leben (1978), 
Etsako obeys the OCP? Would there still be a need for the operation of the Twin SISTER 


CONVENTION? 


9.6.1 Violating the OCP 


The OCP cannot be a universally obeyed constraint. For one thing, languages like 
Etung frequently have exceptional patterns. For instance, words with surface melodies 
HHL and LLH do exist, like [ gare ] ‘pepper’. Such words must either be assumed to 
have underlying melodies like HHL, contra the OCP, or to have a single H tone pre- 
linked in the lexicon contra the AC. In the latter option, the SR of ‘pepper would look 
like (24) (Odden 1986). 


(24) ngare 
V | 
H L 


While prelinking of tones would appear to be able to save the OCP in Etung, represen- 
tations that violate the OCP do appear to be necessary in some languages. 
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One indication for the separate representation of adjacent identical tones in 
a word is their realization: if they are pronounced at different pitch heights, it is 
reasonable to assume that each pitch target results from a separate tone. A case 
in point is found in Dinka (Andersen 1987). Dinka syllables may have L, H or the 
contour HL. Examples are [tl:k] ‘woman’, [ban] ‘chief’, [tn] ‘horns’. Like many 
tone languages, Dinka has a phonetic implementation rule of downstep which causes 
a H after a L to be lowered relative to an earlier H. Any following H has the same 
pitch as the preceding (lowered) H, and if there is another L, a further downstep 
of a following H will take place. In (25), the word [I3::k] is pronounced with down- 
stepped H because of the preceding L. However, downstep also occurs on the phrase- 
final syllable with H tone after a H tone. The pitch of the final H in (25) is just 
what it would have been if L had preceded it. The bisyllabic word for ‘here’, therefore, 
must be assumed to have two separate Hs. In the transcription, downstepped tones 
have been marked with !. The numerical transcription of the pitch contour which 
results from the phonological representation uses 0 for the highest pitch, 1 for the 
next highest pitch, etc. The difference between H and L is 2, and downstep lowers the 
pitch by 1. 


(25) c-ametlo:k niin etgin 
AAN De lal 
H L'H H H!H 
=Z H.-F 2 


‘when did you wash the child here?’ 
perF-2sg child wash 二 PARTICIPLE when here 


As we have seen in section 9.5, languages frequently have boundary tones, and the 
downstep on the final H of Dinka is possibly caused by the interaction between a final 
L-boundary tone and the H on the final syllable. This is shown in (26): an implemen- 
tation rule will realize the final HL after H as a downstepped H. 


(26) 


There are further facts that suggest that sequences of identical tones must be allowed 
to occur in Dinka. L-tones can be downstepped, too. L-tones are downstepped after L, 
provided no H-tone follows in the same phrase. This is shown for a sentence with six 
consecutive L tones, after an H-tone in (27) and in a context without any H-tone 
in (28). The two syllables of the words [tà:ņ-dù] ‘your spear’ in (27) and [an50] 


'There is a possible way round the conclusion that disyllabic high-toned words have two consecutive Hs. It is 
possible to assume that they have HL on the first syllable and H on the second. This is because the contrast 
between the HL contour and H is neutralized in positions before H. The first syllable is pronounced just like H, 
as if L was deleted, while the second is pronounced high in pre-phrase-final position and downstepped H in 
phrase-final position. Thus, in the more elaborate representation [HL H], the OCP is not violated. 
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‘per-have’ in (28) show the same pattern as any two L-toned syllables separated by a 
word boundary, and each therefore must have its own L. 


(27) rjg::m 3a-t9 etaa-du tok 
下 
H. BILT {LIL 
Oh nd ,6 
‘there is blood on your spear 
blood DEF-be on spear-2sg mouth 


(28) anon lec e cwair 
Fak il 
EIL LALIE 
‘he has the teeth of a lior 
DEE-have teeth of thief+ERGATIVE 


The downstepping pattern contrasts with a level contour before a H-tone. In (29), a 
word with H-tones has been added to the L-toned sentence, and here no downstep- 
ping occurs in the low-toned words, not even for the L-tones that are far removed from 
the H. This suggests that the whole of the pitch contour before the first !H is the real- 
ization of a single L-tone. The example shows that the difference between a represen- 
tation with a single L and one with multiple Ls is meaningful: it distinguishes between 
a level low stretch and a contour that descends syllable by syllable. In order to account 
for the representation in (29), we must assume a postlexical rule merging adjacent 
L-tones before H. 


(29) anon lec e cwatr etezin 
| | 
L 'H!H 
22 2 gt Wig TINS 

‘he has the teeth of a lion here’ 


Q77 The majority of Mende words have the tone patterns illustrated below: 

one syllable: ká ‘war’, kpa ‘debt’, mbû ‘owl’, mba ‘rice’, mba ‘companion’ 

two syllables: pélé ‘house’, njaha ‘woman’, ngila ‘dog’, fandé ‘cotton’, bèlè ‘trousers’ 
three syllables: kpakali ‘tripod chair’, félama ‘junction’, ndàvúlá ‘sling’, nikili ‘groundnut’, 


hawané ‘waistline’ 
1. What are the underlying tone patterns of Mende? 


2. Does Mende, like Etung, have constraints on contouring? 
3. Give the tonal representations of the words for ‘woman’ and ‘tripod chair’. 
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Q78 In Kipare, an utterance-final H is downstepped, as shown in (1a) (Odden 1986). 


(1) a. ipa ‘nga ‘machete’ 
b. ipangalédi ‘good machete’ 


Assume that the end of the utterance has a boundary L, which causes the downstep on the final H. 

When L-tones precede the final H, a frequent, though not obligatory, pronunciation is one in 
which the final (downstepped) H spreads leftward to all the L-toned TBUs. (2a) shows the 
pronunciation without the spreading, (2b) the pronunciation with. 


(2) a. nifinikiré mayémbé màè'ðá 4 covered long hoes’ 
b. _ nifi'nikiré mayémbé máéðá 


Although it is optional, the spreading of the final H shown in (2b) is always to all possible TBUs. 
That is, it is not possible to pronounce the sentence with spreading of H to only some of the 
words that are pronounced with low pitch in (2a). What does this suggest about the 
representation of the L-toned words before H spreads? 


Q79 West Greenlandic is a nontonal language, which has a phrase-final boundary melody 
HL and an utterance-final H. At the utterance end, these tones come together to form 
the melody HLH. The timing of these boundary tones varies, as illustrated in the 
following data (Rischel 1974; Nagano-Madsen 1993). Assume that the preceding 
unmarked syllables are unspecified for tone, and are pronounced with middish pitch. 


. akivara ‘| answered him’ 
. ata:siq ‘one’ 
aki va: ‘he answered him’ 


. Uvana sg 
uvanalé ‘and I’ 
uvanatsa: ‘I, too’ 


eatin that the TBU in West Greenlandic is the vowel, with long vowels and diphthongs 
counting as two TBUs, show the structure of the expressions in a, b, c. Show long vowels 
with double symbols (i.e. [a:] as [aal). 

2. State the direction of association. 


Yes-no questions end in the same melody, but are distinguished from Declaratives by the 
lengthening of the final syllable. In particular, short vowels become long, long vowels become 
overlong (i.e. count as three TBUS), and diphthongs have a first element corresponding to a 
long vowel. This is shown in the following data: 


Declarative Interrogative 

g. taku“a:na ‘he saw me’ taku” &na: ‘did he see me?’ 

h. takú"iúk ‘you saw him’ taku" iik ‘did you see him?’ 

i, tsigda: ‘he takes that’ tsigu"a: ‘did he take that?’ 

j. apirai ‘he asked them’ apira:i ‘did he ask them?’ 
Are these data compatible with your account of the association of HLH? Give the 
autosegmental representations of the Declarative and Interrogative forms of ‘he asked them’. 
The penultimate vowel of the Interrogative form in (g) is fully high-toned. What extra 
assumption must be made to account for this fact? 


134 Representing tone 


9.7 CONCLUSION 


Goldsmith's thesis (1976) represented a breakthrough in our conception of phono- 
logical representation. While the idea that speech is to be represented as a number of 
parallel tiers of elements was not new, and had in particular been advocated by a group 
of British phonologists who worked in association with J.R. Firth (e.g. Palmer 1970), 
the intellectual environment in which Goldsmith did his research ensured that its 
impact went well beyond the question of how to represent tone. Autosegmental 
phonology soon came to be applied to segmental duration, as we will see in the next 
chapter, as well as to phonological features like [nasal] and [+ ATR], and eventually 
led to what is now known as Feature Geometry (chapters 11 and 12). At about the 
same time, beginning with the work of Liberman and Prince (1977) (chapter 13), the 
representation of stress underwent an evolution which, together with the renewed 
appreciation of the role of syllabic structure (Kahn 1976; see Chapter 10), led to the 
introduction of prosodic structure in phonology and ultimately to what we now know 
as Prosodic Phonology (Chapter 15). The SPE model of phonological representation, 
with its strict sequential arrangement of segments and boundaries, came to be known 
as a linear model, while the new conceptions came to be collectively referred to as 
nonlinear phonology. 
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10.1 INTRODUCTION 


In this chapter we will consider the way in which segments are organized into syllables. 
The syllable dominates the segments only indirectly. That is, between the syllable and 
the segment a further level of structure is commonly assumed. One view of this inter- 
mediate level of structure is that of the CV tier. Two of the roles this tier is required to 
fulfil are first, the representation of segmental duration and second, the designation of 
the syllabicity of the segment, i.e. whether it occurs in the peak or, alternatively, in the 
onset or coda. In a sense, then, the topic of Chapter 9 is continued, but instead of tone 
we now consider segmental duration as a candidate for autosegmental representation. 
The arguments for the CV tier will be reviewed and illustrated in section 10.4. There is, 
however, an alternative representation for these aspects of phonological structure. One 
reason for including the syllable in phonological structure is that it forms the domain 
to which stress is assigned: it is the structural element that can be either stressed or 
unstressed. In many languages the ‘stressability’ of a syllable appears to be influenced 
by the contents of its rhyme: broadly speaking, single (short) vowels behave differently 
from other syllables. Accordingly, the mora is taken to be the intermediate level of 
structure that allows the distinction between such light (or monomoraic) and heavy 
(or bimoraic) syllables to be expressed. It is generally seen as a representation that 
makes the CV tier superfluous, the idea being that all the roles that have been claimed 
for it can be fulfilled by other aspects in the phonological representation. 

In addition to the behaviour of stress and the sequential restrictions that were 
referred to in Chapter 2, where *[ætk] was said to be an impossible syllable of English, 
the evidence for the syllable as a phonological constituent comes from the fact that it 
must be referred to by many phonological generalizations. Examples of such syllable- 
based generalizations are given in section 10.6. Finally, we consider the need for the 
ambisyllabic representation of certain consonants. 


10.2 A SKELETAL TIER 


In the previous chapter we have seen that tone is best represented as an independent 
string of segments. Another aspect of the pronunciation of words that appears to 
behave independently of the segments is segmental duration. A particularly clear 
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example is provided by language games. One of these, called Ludikya, is reported by 
Clements (1986) for Luganda, and involves a reversal of the order of the syllables in 
the word. Luganda has both a vowel length contrast and a contrast between long and 
short consonants. Instead of by the length mark, we indicate long segments by dou- 
bling the symbol. The striking thing is that although the vowels and consonants move 
round in this game, the durational structure remains intact, as shown in (1). 


(1) Luganda Ludikya 
mukono nokomu ‘arm’ 
mubinikilo lokinibimu ‘funnel’ 
baana naaba children 
zuba bbaju ‘dove’ 
kiwoyolo lojowwoki ‘butterfly’ 
kubajja jabakku ‘to work in wood’ 


In order to be able to describe this process, the mutual independence of segmental 
duration and segmental quality needs to be expressed in the representation. Clements 
and Keyser (1983) propose that segments are not immediately associated to their 
syllables, but are dominated by structural positions, known as (skeletal) slots, which 
encode the segment durations. Consonants and vowels that are associated to single 
slots are short, while long vowels and geminate consonants are represented as being 
doubly linked to two slots. Additionally, they assume that there are two types, a C and 
a V, where the C represents a syllable margin (onset or coda) and V a syllable peak. 
That is, CV slots take over the role of the feature [+ syllabic], in addition to providing 
a representation for segmental duration. 

The CV slots are dominated by the syllable nodes, each syllable being represented by 
o. For example, a language with a duration contrast for vowels as well as for consonants, 
like the Dravidian languages Malayalam and Tamil, would have representations like those 
in (2), which represent the Tamil words [patu, pa:tu, paț:u, pa:t:u], respectively (Firth 
1957, Mohanan 1986: 108). All four words consist of two syllables each and have the same 
string of segments; however, they differ in their segmental timing structure. 


(2) o o o o 


AA oE ara s ge tak pe, “bed 


a. CVCV b CVVCV 1G Nae GN. dh CMV. GEN 
ITI | Lat ad likl IV VI 
patu pa tu patu P a rlu 
‘to endure’ ‘to sing ‘enduring’ ‘a song 


The CV representation allows us to characterize the Luganda word game Ludikya 
illustrated in (1). It involves a reversal of the phonological content of the syllables of 
each word, with the retention of the structure above the segments. This is shown in 
(3), where the top line gives the order of the segments in the Luganda word, while the 
bottom line gives the Ludikya version. Importantly, the strings of skeletal slots are 
identical in the two versions of each word. 
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(3) bana kuba } a 
IAI | LIIIAI 
CVVCV CVCVCCV 
IV 11 ll 
naba aba ku 


In addition to providing a representation for the duration and syllabicity of segments, 
the CV tier has been motivated on the basis of other phenomena. These arguments 
involve the representation of morphemes in terms of CV templates, the use of C slots 
without associated segments or without association to the syllable node, and the 
description of compensatory lengthening as the spreading of segments to vacated 
slots. These arguments are discussed in section 10.4. Before considering them, we 
need to say a few things about syllabification. 


10.3 SYLLABIFICATION: THE MAXIMUM ONSET PRINCIPLE 


The syllabic affiliation of the C slots is largely predictable. The presence of the inter- 
vocalic consonant in the onset of the second syllable, rather than the coda of the first 
(cf. (2a,b)) is universal. That is, consonants prefer to form an onset rather than a coda, 
if they can legitimately do so. This is why Clements and Keyser assume that at best 
only the association of the V slot with a vowel is lexically given, but that the associ- 
ations of the C slots with the syllable nodes can be derived by rule. The principle 
responsible for maximizing the onset is (4) (Kahn 1976). 


(4) | Maximum ONSET Principe (MOP): First make the onset as long as it legitimately can 
be; then form a legitimate coda. 


The MOP requires that a string like [tata] should be syllabified [ta.ta], rather than 
*[tat.a]. And if the language allows [st] onsets, then a string like [asta] will be [a.sta], 
rather than *[as.ta] or *[ast.a]. Of course, it is essential to keep in mind that, while the 
MOP is a universal principle, languages differ in the kinds of syllables they allow. For 
example, Dutch and English allow [st] in the onset, but Spanish does not. Application 
of (4) to a Spanish string [basta] will therefore yield [bas.ta], not *[ba.sta]. By the 
same token, German syllabifies extra as [eks.tra], but Dutch as [ek.stra], because 
Dutch does, but German does not have [st] onsets, even though it does have [Jt] 
onsets. And since no geminates can appear in the onset, the geminate is divided over 
the coda of one and the onset of the next syllable (cf. (2c,d) above). The pronunciation 
of a glottal stop before a vowel in languages that don't have a glottal stop in the lexical 
inventory, as in English How awfull, is a sure sign that the vowel has no onset, at least 
not underlyingly. 

Languages differ in the syllabification domain. In West Germanic languages, the 
syllabification domain tends to be the word, or rather, the phonological constituent to 
be introduced in Chapter 15 as the phonological word. For instance, the English see 
Morton is nonhomophonous with seem Orton, the first syllable being [si:] in the first 
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phrase and [si:m] in the second, which difference will have an effect on the duration 
of [i:], in particular. Other languages may have larger domains, like French, which syl- 
labifies across words and where petit ami ‘little friend’ and petit tamis ‘little sieve’ are 
both [po.ti.ta.mi], or smaller domains, like Japanese, where certain suffixes will not 
syllabify with the stem, as shown by /a.ni/ ‘brother’ and /an.i/ ‘easy going’. In the 
Japanese case, the corresponding phonetic difference is very clear, because the onset 
nasal is coronal, but the coda nasal more likely to be velar or uvular. That is, /a.ni/ is 
pronounced [ani], and /an.i/ [an?i]. More generally, prefixes in German do not syllab- 
ify with the stem, as in Verein [fer?ain] ‘club’, where Ver- is a prefix (section 5.3). 


10.3.1 The sonority profile 


The segmental composition of onsets and codas show striking similarities across dif- 
ferent languages. If you were to bet on the type of consonant that could occupy the sec- 
ond position in a CC onset in some unknown language, your chances of winning 
would be well served by opting for a glide like [j] or |[w].And if your bet concerned the 
first consonant, you would be well-advised to go for [p], [t] or [k]. Venneman (1972) 
described these tendencies in terms of a number of syllable ‘laws’. Much of the regu- 
larity is captured by the Sonorrty PRoFILE, given in (5). 


(5)  Sonority Prorite: The sonority of a syllable increases from the beginning of the 
syllable onwards, and decreases from the beginning of the peak onwards. 


Intuitively, sonority is related to the overall acoustic energy of segments. In (6), the 
classes of segments that are usually distinguished along this dimension are listed in 
the order of increasing sonority. 


(6) ~ soNORITY scare: Obstruents - Nasals - Liquids ([l,r], etc.) - Glides ((w,j], 
etc.) — Vowels 


Thus, the observation is that any onset that reverses the direction of increasing sonor- 
ity, like [mk-] or [wl-], is less common than one that does not, like [pn-] or [ml-]. 
Conversely, any rhyme that increases the sonority from left to right, like [-lj], is dis- 
favoured. Violations of the sonority profile are indeed rare. Swedish has syllables like 
[bærj] ‘mountain’, while Dutch has syllables like [urn] ‘wring’, where the first conso- 
nant is a labial glide. In addition, Clements (1990) observes that in the onset large 
sonority differences are preferred over small ones, making [pj-] a better onset than 
[lj-], while in the rhyme small sonority differences are preferred over large ones, mak- 
ing [-j] a better coda than [-t]. Apparently, syllables prefer to start with a bang and end 
with a whimper. Because the beginning of the syllable is maximally salient as a result, 
catching the listener’s attention when it should, the reason for this state of affairs is 
probably perceptual. 

The preference of onsets with increasing sonority will lead to syllabifications of 
intervocalic sequences of consonants as in [akla] as [a.kla]. That is, in a language that 


allows CC onsets, [alka] will typically come out as [al.ka], and not as *[a.lka] but [akla] 
as [a.kla]. 
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There is an interesting consequence of the observation by Clements (1990) that syl- 
lables, to repeat the phrase, start with a bang and end with a whimper. Syllables tend 
to group in words so that the sonority of the end of one syllable is greater than that of 
the beginning of the next, favouring a whimper-bang transition over a bang-whimper 
one. In languages without complex onsets or codas, [al.ka] will therefore be a more 
likely structure than [ak.la]. This tendency was earlier described as the SYLLABLE 
Contact Law by Venneman (1972). So also when considering the relation between syl- 
lables, the tendency is to maximize the bang at the beginning. 

Sonority can be defined in terms of the features introduced in Chapter 5 (Clements 
1990). The four consonantal sonority classes of (6) are so characterized by the features 
[+cons], [son], [approx]. To distinguish vowels from glides, the syllabicity needs 
to be called upon. 


Q80 What do the English words stigma, comrade and Daphne have in common? 


10.4 ARGUMENTS FOR THE CV TIER 


In this section we consider three types of argument that have been made for the inclu- 
sion of segmental timing slots in phonological representation. Later in this chapter, we 
will see that the role of these timing slots is in part better served by a more econom- 
ical representation, that of the mora. 


10.4.1 The templatic use of the CV tier 


The independent status of the CV tier was first proposed in McCarthy (1985a). It was 
inspired by the fact that certain morphemes in Arabic appear to be specified in terms 
of strings of consonant and vowel positions, or skeletal slots, referred to as templates. 
The morphology of the Arabic verb includes a number of derivational morphemes 
(‘conjugations’), which are not just expressed by particular affixes, but also by differ- 
ent templates. In some cases there is no affix, so that conjugations can differ only in 
the CV template they have. The verbal root consists only of consonants, usually three. 
The vowels in such verbal forms represent the third type of morpheme, which corres- 
ponds to verbal aspect or voice. In (7), some of these morphemes are listed. A verbal 
form, then, consists minimally of a verbal root (two or more consonants), a conjuga- 
tion (a CV template) and a verbal aspect (one or more vowels). 


(7) ktb ‘write’ 
hq ‘be true 
CVCVC Plain 
CVCCVC ‘Intensive’ 
CVVCVC ‘Influencing’ 
a ‘Active Perfective’ 


ui ‘Passive Perfective 
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The corresponding forms for ‘write and ‘be true are given in (8), where the final [a] 
is the third person masculine singular. To derive these forms, it is necessary to assume 
that the consonants and vowels form separate tiers, and that the consonants associate 
with the Cs and the vowels with the Vs. Moreover, adjacent slots of the same type are filled 
by the same segment. That is, association is not one-to-one, but subject to the require- 
ment that long vowels and geminate consonants are created where possible. This is 
shown in (9), where the third person suffix is not shown.! 


(8) Conjugation Active Passive 
Plain katab-a ‘he wrote’ kutib-a ‘it was written’ 
Intensive kattab-a ‘he caused to kuttib-a ‘it was made to 
write’ write 
Influencing kaatab-a ‘he corresponded kuutib-a ‘he was corres- 
ponded with 
Plain ħaqq - a ‘it is true’ 一 
Intensive haqqaq-a ‘he realized st? fuqqiq-a ‘it has been 
realized’ 
Influencing ħaaqaq-a ‘he contested huugiq-a ‘his right was 
sb’s right contested’ 
(9) Kt } k t D 
[HEYRAN] | rod 
CVCCVC CVV OVC 
SS" yw | 
a ú i 


‘he caused to write’ ‘he was corresponded with’ 


For the morpheme ‘be true’ we could either assume an underlying ‘triliteral’ root 
([ħqq]) or a ‘biliteral’ one ([ħq]). If we assume that the direction of association is left 
to right, the form [haqqaq-a], for instance, can be accounted for by the spreading of the 
[q] of a biliteral root, as shown in (10): the left-over C slots are filled by the spreading 
of the last consonant. There is strong evidence that this latter solution is correct, In 
verbal and nominal forms with three surface consonants, it is never the case that the 
first two consonants are identical, a rare exception being [dadan] ‘plaything 
(McCarthy 1985a: 146). This is explained if repetitions of the same consonant are pro- 
duced by the left-to-right spreading of a rightmost consonant segment to empty C 
slots, as shown in (10). Thus, a constraint on the underlying form of Arabic roots is 
that there are no adjacent identical consonants. This constraint is evidently akin to the 
ban on sequences of identical tones discussed in the previous chapter, the OCP. This 
constraint, therefore, has come to be generalized to a ban on adjacency of identical 
phonological elements in general. 


! The expected form *[haqaq-a] ‘it is true’ is ruled out because of a general process deleting vowels in this 
context. 
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(10) h q h q 
CV CC VG GCVVCV EG 
Yo 
a iy 
‘he realized s.t. ‘his right was contested’ 


Q81 A secret language based on Amharic uses the forms in the second column for the 
words in the first (McCarthy 1985). The apostrophe indicates glottalization of the 
consonant; adjacent glottalized consonants are only marked once for glottalization. 


Amharic Disguised form Gloss 

a. gin gainan ‘but’ 
matt’a mait’at’ ‘come’ 
krfu kaifaf ‘cruel’ 
t’att’a t’ait’at’ ‘drink’ 
hed haidad ‘go’ 
waddada waidad ‘love’ 

b. wark’ wairk’ak’ ‘gold’ 
tamara taimrer ‘learn’ 
sigara saigrar ‘cigarette’ 
sakkara saikrar ‘drunkard’ 
kabad kaibdad ‘difficult’ 
wyfat waiftat ‘lie’ 


What is the difference between the Amharic words in (a) and those in (b)? 

Which segments are preserved in the speech disguise forms and which are not? 

What determines the number of (surface) consonants in the speech disguise forms? 
Give an account of the formation of the speech disguise forms, employing a 
representation with a skeletal CV tier. Consider [ai] a single short vowel that associates 
with a single V slot. 

How does your description of this secret language differ from Arabic verbal morphology? 
Give the representations of the disguise forms for ‘drink’ and ‘gold’. 


Se Soc 


Q82 Scottish Gaelic has a three-way length opposition for vowels, ‘short’, ‘half-long’ and 
‘fully long’, as illustrated by [fin] ‘we’, [fi:ni] ‘venison’, [fi::nj] ‘to sing’, or [tut] ‘to go’, 
[u:t] ‘apple’, [su: ‘eye’. At most two different vowel qualities may occur per syllable, 
as illustrated in [ian] ‘John’ and fia:n] ‘bird’. in addition to this constraint, there are 
these two facts: 


a. Diphthongs are either half-long or fully long. 

b. The distribution of the durations of two intrasyllabic vowel segments is ‘short, short’ (as in 
[ian]) or ‘short, half-long’ (as in [ia:n}), but never ‘half-long, short’ (*“[i:an]) (Ternes 1973: 96). 

How would you account for these facts? 
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10.4.2 Unfilled and unassociated slots 


The slots in the CV skeleton are structural positions, which in the usual case dominate 
segmental material and are associated with a syllable node. By assuming that slots 
need not have an association with any segments and need not have an association 
with a syllable, their explanatory role can be extended. Clements and Keyser (1983) call 
on both these possibilities to account for the alternation between French morpheme- 
final consonants with Ø, known as Liaison, and the phenomenon of h-aspiré. 

The French definite article is [la] (masc.) or [la] (fem.) for the singular, and [le] for 
the plural, as shown in (11a). When the noun begins with a vowel, the vowel of the sin- 
gular definite article is deleted, while the plural is followed by an apparently inserted 
[z], as can be seen in (11b). In fact, many words appear to be have a consonant that 
only shows up before a vowel, like [pati], which has a [t] in [patit ami] ‘little friend’. 
These potential consonants are called Liaison consonants. 


(11) Singular Plural 
a. labwa le bwa ‘wood’ 
lə pa le pa ‘step’ 
la karaf le karaf ‘carafe 
la nyi le nyi ‘night 
b. labe lez abe ‘priest’ 
lide lez ide ‘idea’ 


The loss of the vowel of the singular definite article is effected by euision (12), which deletes 
the final V slot of the definite article before V, causing its syllable node to be lost as well. 


V 
(12) i BES ON | —- &__V 
[...]DEF 


LIAISON is expressed in (13), which says that an onsetless syllable syllabifies any unsyl- 
labified consonant appearing before that vowel. 


(13) LIAISON o 
2 
CY 


These rules produce the correct results if the singular and plural forms of the article 
are represented as in (14a,b). The singular will lose its vowel before a V-initial word, 
while the unsyllabified final consonant in the plural will only be syllabified, and pro- 
nounced, before a V-initial word. 


t4- a 0 b 6 
A A 
CY CING 
| | [isthe 
l a lez 
‘DEF-SG-FEM? ‘DEF-PL’ 
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Certain vowel-initial nouns appear to behave as if they began with a consonant: they 
take the preconsonantal alternants of the definite article, both in the singular and in 
the plural, as shown in (15). In order to account for the behaviour of words like ‘hero’ 
and ‘hatred’, their underlying forms are assumed to begin with an empty C, which is 
syllabified as an onset C just as any filled C would be. This is shown in (16b), which 
representation should be compared with (16a). 


(15) Singular Plural 
la ero le ero ‘hero’ 
laen le en ‘hatred’ 
(16) "ase ae be or be 
MRA 1 A 
VQ Y CNG 
ee | || 
abe e ro 


Thus the deletion of pre-V [a] in (17a), as well as its retention in the pre-C context of 
(17b), are readily accounted for. Likewise, the difference between the presence of the - 
liaison [z] in ‘the priests’ (18a) and its absence in ‘the heroes’ (18b) is explained by the 
inability of the unsyllabified [Z] to undergo Liaison in (18b). This solution accurately 
expresses the fact that words with h-aspiré behave as if they began with a consonant, 
even though no surface consonant is observed. Also, it correctly characterizes both the 
distribution and the identity of the Liaison consonant. 


(WZ), co? ire o o 
\ A A AAN 
GGN E i N Del 7 AN G 
rae | | | | 
lide kä €n 

(18) a o oe wo b. o o 6 
IN. ANS OFS A NA 
CV CV CVE CU Wi VE 
are i Si oe E 
lezabez Lez Po 2 


10.4.3 Compensatory lengthening 


A final type of argument for the existence of the skeletal tier is the phenomenon of 
compensatory lengthening. Frequently, the loss of a segment is incomplete in the 
sense that the time it took before it was deleted is preserved in a neighbouring 
segment. The parent language of English and Frisian, which at one time constituted a 
West Germanic dialect sometimes referred to as Ingwaeonic, underwent a process of 
nasal loss before fricatives within the word. Prior to the loss of the nasal, the vowel 
before it was short, but a long vowel remains today. In (19), we give reconstructed 
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(hypothetical) forms. The presence of the nasal is attested in Gothic texts, as well as in 
modern German. (Probably, the nasalization was also preserved on the vowel at first, 
and was lost later. We ignore this in the transcriptions.) 


(19) gans gais ‘goose’ 
fimf fif five 
tanb ta:0 ‘tooth’ 
munb mu:0 ‘mouth 


Again, if segmental duration is contained inside a single element with the rest of the 
information about the pronunciation of the segment, we can only describe the change 
with the help of the transformational format. The representation with the skeletal tier 
allows one to express the change as a retiming of the segments, as shown in (20). An 
additional measure, one which is automatically triggered by the [—cons] segment 
associated with it, is a change of the vacated C slot into a V slot. 


we PE A 
GV Gy Guo, OC V WG 
Lt dad L Wong 
mun @ m i p 


Compensatory lengthening can also be recognized in consonants that have come to 
occupy the syllable peak (syllabic consonants). The second syllable in English bottle 
['bət]] consists purely of the consonant [I]: the release of the plosive [t] takes place 
through a lowering of the sides of the tongue, the tip retaining its contact (lateral plo- 
sion). Assuming that the duration of the syllabic [I] is no different from the duration 
of [al], a less common but attested pronunciation, the syllabic consonant could be rep- 
resented as in (21a). In Mandarin Chinese, retroflex syllabic consonants derive histor- 
ically from the spreading of the onset consonant to the syllable peak, as in [ts%.] ‘late’ 
or [2] ‘wet’. In (21b) the voicing of the consonant will follow from a constraint that 
syllable peaks are voiced (Wiese 1986). 


Retiming of segments is a well-known feature of Bantu languages. Class prefixes like 
[ki] usually only show up in that form before consonant-initial stems. When the stem 
begins with a vowel, however, the close vowel turns into a glide while the vowel length- 
ens. Consider the Luganda examples in (22), from Clements (1986). 


(22) li + ato lja:to ‘boat 
mu + iko mwi:ko ‘trowel’ 
ki + uma kju:ma ‘metal object 


mi + aka mjatka ‘year’ 
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If it is assumed that the resulting onset does not consist of two consonants, but of a 
single, complex consonant which is either labialized (in the case of retimed [u]) or 
palatalized (in the case of retimed [i]), the process can be described as in (23). 


(23) © oo o C 


Q83 in Catalan, a word-initial high vowel turns into a glide when preceded by a stressed 


vowel, as in [kunta'rajs'torias] contará histories ‘he will tell stories’ and 
[keta'lawniBar'sal] catalá universal ‘universal Catalan’ (Hualde 1992). Assuming that 
the duration of the vowel plus glide is equivalent to that of a short vowel, how could 
you describe this process in CV phonology? 


10.5 MORAS 


It has been pointed out that the skeletal tier fails to account for two phenomena (Hayes 
1989). One is that compensatory lengthening always occurs in the case of segments 
deleted from the rhyme, never in the case of segments deleted from the onset. This 
suggests that segments in the rhyme possess something that other segments do not 
possess. Second, it appears that many languages distinguish syllables on the basis of 
quantity, a property of syllables which is determined by the number of segments in 
the rhyme, again to the exclusion of their onset. As for the first objection, consider the 
fate of the onsets [kn-, gn-] in Middle English. When [k,g] were lost, and the English 
syllable no longer admitted onsets that consisted of [—cont] [+nas], there was no 
compensatory lengthening of any of the other segments in words like knot and gnat, 
now pronounced [not] and [net], not *[n:ot] or *[no:t], etc. 

Second, the location of the word stress frequently appears to be sensitive to the seg- 
mental composition of the rhyme, while the number of segments in the onset is irrele- 
vant (barring a few exceptional and incidental instances). In Hawaiian, for instance, 
stress falls on the last syllable if it contains a long vowel, and on the penultimate sylla- 
ble if the last is a short vowel, as illustrated by [na'na:] ‘strut’ and ['na:na] ‘for him’. 
Significantly, a consonant in the rhyme is often counted as if it was a vowel. In Hopi, for 
instance, the stress falls on the first syllable if the rhyme contains a long vowel or a short 
vowel and a consonant, as shown in (24a), but on the second syllable if it contains a 
short vowel, as shown in (24b) (Jeanne 1982). That is, the consonant in the rhyme can 
determine the location of the stress in a way that a consonant in the onset cannot. 


(24) a. 'qoq.te.som.pi ‘headbands’ 
'so:.ja ‘planting stick 
b. qø.'tø.som.pi ‘headband’ 


ko.'jo.po ‘turkey’ 
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In order to capture the difference in status between segments in the rhyme and seg- 
ments in the onset, the mora has been proposed as an intermediate level of structure 
between segments and the syllable (Hyman 1985, Hayes 1989). The prosodically 
‘active’ status of a segment can then be expressed by giving it moraic status, while 
‘inactive’ segments are not entitled to their own mora. A language that has no vowel- 
length distinction and does not allow a coda will have only syllables with one mora. 
A language with a vowel-length distinction has both monomoraic and bimoraic syl- 
lables. The term weight is used to refer to the opposition between monomoraic (or 
light) and bimoraic (or heavy) syllables. Onset consonants are nonmoraic, and in the 
representation of Hayes (1989) are attached directly to the syllable node. Postvocalic 
consonants may or may not be moraic. They are moraic in Hopi, where the stress rule 
treats the coda consonant on the same basis as the second half of a long vowel. 
Because vowels are always moraic, but coda consonants may or may not be, Hayes 
assumes that languages may or may not apply the rule of wEIGHT-BY-POSITION, by which 
a postvocalic consonant is assigned a mora. In (25), some syllable types are repre- 
sented. As will be clear, (25a,c) are light syllables, while (25b,d) are heavy syllables.’ 


(25) a o b. o ef o d. CT 
T | Ah 和 Bag vt art 
short V longV  nonmoraiccoda += WEIGHT-BY-POSITION 


Geminate consonants must be moraic in order to express their duration. The Italian 
distinction between [fato] ‘fate’ and [fat:o] ‘fact’ is expressed by representing the 
nongeminate as an onset consonant, and the geminate as a consonant that is attached 
to a mora in the first syllable and as a weightless onset in the second, as shown in (26). 
Languages with intervocalic geminate consonants cannot be monomoraic. 


o œ b. o o 
Ar MM 
oe Dy 
Although it is not uncommon for languages to allow the last syllable of the word to have 


three moras, languages that generally allow their rhymes to have three moras are rare. 
That is, if a language has wEIGHT-BY-POSITION,Usually only the first consonant after a short 


(26) a. 


>The mora tier has usually been argued for as an alternative to the CV tier (or the X tier, in variants that make 
no distinction between C slots and V slots), not as an additional tier. Lahiri and Koreman (1988) observe that 
syllable weight can be independent of vowel duration, and suggest that both types of representation may be 
present in at least some languages. 
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vowel is moraic, while the next consonant or any consonant after a long vowel is non- 
moraic. Languages that have three degrees of vowel length, like Dinka, and languages 
that allow a geminate to follow a long vowel have trimoraic syllables. Such syllables are 
called superheavy. Tamil, as we saw in (2), is an example of a language with geminates 
after long vowels. The moraic representations of those four structures is given in (27), 
where (27a) has a light, (27b,c) a heavy and (27d) a superheavy first syllable. 


o d. o 
| 


(27) a. o o b. o © C o o 
[* js 人 人 [+ u /A H 
| / | Wa | VV | 
part <u peta. a pa to pea U 


Since trimoraic languages are rare, it is common for long vowels to be banned 
before geminates. For instance, in Koya, while words like [a:nd] ‘female power’, 
[manasu:rku] ‘men’, [me:ndu:li] ‘back’, in which long vowels appear before a coda con- 
sonant, are fine, as are short vowels before a geminate, as in [pik:a] ‘cup’, a form like 
* [pu:t:i] is ill-formed. In fact, when such a combination arises as a result of morpho- 
logical concatenation, the vowel is shortened, as shown in (28). 


(28) ke:t: onda [ket:o:nda] ‘he told’ 
o: - t:-omdu [otro:ndu] ‘he brought 
The restriction is explained if the Koya rhyme is restricted to two moras, where the 
second can either be the second half of a long vowel or a consonant, additional conson- 


ants being nonmoraic. Since geminates cannot be nonmoraic, they require the vowel 
before them to be short (Tyler 1969). 


Q84 Word stress in Koya falls on every long syllable and every closed syllable in the word, 
and the language therefore has wEIGHT-BY-POSITION (Tyler 1969). Koya does not have 
complex onsets. Give the moraic-cum-syllabic representations of the words [pik:a], 
[manasu:rku] and [beske] ‘when’. 


The syllable structure of Tera is CV(X), where X can be V or C. Except at the end of the 
sentence, word-final [a] is deleted unless the preceding syllable has a long vowel or 
when it is immediately preceded by two consonants. Thus the words in the first 


column are subject to the rule, but not those in the third (Newman 1970). ([mb] and 
[nd] are single consonants.) 


ə deleted a retained 

sədə ‘snake’ pərsə ‘horse’ 
wura ‘tomorrow’ me:nə ‘today’ 
mbuka ‘to throw at’ xennda ‘nose’ 


1. What determines if [a] is deleted? 
2. Does Tera have WEIGHT-BY-POSITION? Motivate your answer. 
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Q86 The following words show the position of the main stress in a number of words in 
Lenakel (Hayes 1995). 


napuk ‘song’ 

ttkomkom ‘branches’ 
natjakamwap'ke:n ‘you two will be jealous’ 
reme'tja:w ‘he arrived’ 
tup”alu’kaluk ‘lungs’ 


Does Lenakel have weicHT-By-PosiTion? Motivate your answer. 


Q87 Here are some examples of a word game in Bakwiri (from Hombert 1986). (Bakwiri is 
a tone language and tones are marked in the usual way; it has prenasalized stops: 
[mb], [nd] and {[ņg].) 

Bakwiri Word game 

mòkò kòmò ‘plantain’ 

lówá wáló ‘excrement’ 

kéndi ndiko ‘rice’ 

ind nmi ‘throat’ 

éz6é Z6?ee ‘it is not’ 

lùùngá ngààlú ‘stomach’ 

Ñòßá Bààljó ‘door’ 


a. 
b. 
c. 
d. 
e. 
f. 

g. 
le 


What is the general rule for forming word game words? Apply your rule to the 
hypothetical form [ndak6d]. 

Bakwiri syllable structure is (C)V(V), where VV may be filled by one vowel, or by two 
different vowels. Give relevant examples. 

Why isn’t the game form for ‘door’ *[Baalio]? 

Why do you think there are glottal stops in the words for ‘it is not’ and ‘throat’? 


Q88 Kisanga, another tone language, has the syllable structure (C)V(V). In a children’s 
language game called Kinshingelo, sentences (a) and (b) are pronounced as (c) and 
(d), respectively (Coupez 1969). 


a. Obé múk”è:tù  t”âjá: kú múkólá 

‘you, my friend, come with me to the river’ 

bâ:nábákàjí bâ: mú kóngò bă:tèm"â:  kúdímá 

‘the Congolese women love to cultivate the land’ 

béó mitick”é = iat”: kú mudlik6 

bâ:nábájiká bâ: mú ngókò bimwate: komadf 
Argue that this language game provides evidence for the existence in Kisanga of: the word: 
the syllable; the tonal tier; a timing tier. 
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restrictions. 
Base 


Q89 Tagalog has a denominal formation meaning ‘an imitation of’ (Schachter and Otanes 
1972). The formation involves the reduplication of the noun and the suffixation of [an], 
as shown below. Syllable onsets in Tagalog are C or CC, where — at least in the native 
vocabulary — the C in second position is either [j] or [w]. The syllable may have a coda, 
which is obligatory in word-final position. In addition to many other consonants, [h] 
and [7] can occur at the end of the word, but these consonants do not appear word- 
internally in the coda. Vowel length is in general unpredictable, except in word-final 
position, where vowels are always long. This can be seen in the forms in both 
columns. However, vowel length in ‘imitation of forms is subject to further 


Derived form 


a. bathayj ‘house’ bahajbahaija:n ‘doll house’ 

b. libru:h ‘book’ libru:libru:ha:n ‘imitation book’ 
c. pa:ri:? ‘priest’ parizpariz?ain ‘pretended priest’ 
d. bulakla:k ‘flower’ bulaklakbulakla:kazn ‘artificial flower’ 
e. kabazju:h ‘horse’ kabaju:kabaju:zha:n ‘imitation horse’ 
f.  dazho:n ‘leaf’ dahondaho:na:n ‘imitation leaf’ 


There are two generalizations to be made about vowel duration in ‘imitation of 
formations. The first concerns syllables that are not stem-final. What is it? 
2. The second generalization concerns stem-final (but not word-final) syllables. What 
is it? 
3. Is the second generalization more easily expressed in terms of a moraic representation or 
a CV representation? Motivate your answer. 


10.6 SYLLABLE-BASED GENERALIZATIONS 


In this section we consider the role of the syllable in the SD of phonological general- 
izations. It is very common for consonants to be pronounced differently depending on 
their position in the syllable. Thus, as was explained in Chapter 4, British English [1] is 
accompanied by velarization whenever it appears in the coda, as shown in (29). 


Observe that [lj-] is a legitimate onset, occurring word-initially in a word like lure 
[ljuə], for instance. 


(29) In onset In rhyme 
[lar] lie [ha:4] hall 
[luk] look [kitt] kilt 
['væli] valley ['mitdju:] mildew 


[r'tæljən] Italian ['morat] moral 
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The need to include syllable structure in phonological representations was demon- 
strated most clearly by Kahn (1976), who showed that in many cases the context of 
a rule could only be expressed in segmental terms at the cost of fairly baroque speci- 
fications. In the case of l-veLarizaTion, we would have to state that it applied ‘before 
all consonants except [j] and at the word end’. Obviously, this approach does not 
yield a natural class. It would not be possible either to derive the nonvelarized 
allophone from an underlying velarized [t], as in that case we would have to state 
the context as ‘before all vowels and [j]’. Again, this is not a natural class, since [w] 
is excluded. Reference to the syllable makes it possible to formulate the context as 
in (30). 


(30) I-vetarization! 一 t/,(...V C...), 


Many other examples can be given of consonants having different allophones depend- 
ing on whether they appear in the onset or the rhyme. Thus, the Dutch labial approxi- 
mant is labiodental [v] in the onset and rounded bilabial [B] in the coda in the 
varieties spoken in the northern half of the Dutch language area,’ as shown in (31). 
Again, neither the generalization ‘in the coda’ nor the generalization ‘in the onset’ can 
be expressed directly. Since Dutch allows [ur-] onsets, the onset happens to be equiva- 
lent to ‘before a vowel or [r]’, while the coda must be captured by ‘before a consonant 
other than [r] or a word boundary’. Such SDs clearly look arbitrary, and fail to express 
just what it is that governs the allophony. 


(31) In onset In rhyme 
{le:'vin] ‘lioness’ [le:B] lior 
[vre:t] ‘cruel [xe:Bt] ‘yawn+t+35G 
['kle:van] ‘scimitar’ [‘lyBta] Jul? 


10.7 POST-MOP SYLLABIFICATION RULES 


The Maximum Onset Principle (4) introduced in section 10.3, together with the speci- 
fication of a syllabification domain within which it is applicable, suffices to explain the 
syllable structure of many languages. Other languages have additional syllabification 
rules. The motivation for these additional rules is that consonants which from one 
point of view appear to be in the coda would seem to act as onset consonants of the 
following syllable when viewed from a different perspective. Somewhat crudely, we 


> Dutch is spoken in the north-western half of Belgium and in the Netherlands. Many English-language publi- 
cations use the word ‘Flemish’ when referring to Dutch as spoken in Belgium. 
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might say that there are two possible approaches to this situation. It is possible to redo 
the initial MOP-driven syllabification after a derivation or cliticization, so that what 
appeared as a coda consonant at one level of representation appears as an onset con- 
sonant at another, a description known as resyllabification. If the first syllabification 
was V.CCV, as shown in (32b), the second would be (32c). Other descriptions will go on 
the assumption that the MOP is a persistent principle and that additional syllabifica- 
tion rules must respect existing syllabification. The new syllabification is ‘added’ to the 
old, and as a result a consonant may simultaneously belong to the coda of one syllable 
and to the onset of the next: such consonants are said to be ambisyllabic. As stressed 
by Kahn, the latter position is more restrictive, in that it limits the number of possible 
syllabifications. In the hypothetical situation (32), the syllabification required by the 
MOP would be VCCV and only the first C can be ambisyllabic (32a), since the 
No Crossinc Constraint rules out the ambisyllabicity of a second C. By contrast, 
in a resyllabification analysis, there is no principled limit to the number of consonants 
that can be resyllabified. Starting out with (32b), we could change that into (32c), but 
there is no reason why a further resyllabification could not change that to VCC.V 
(not shown). In this section, we will argue that ambisyllabicity is in fact a necessary 
representation on the basis of Kahn's description of (American) English FLAPPING. In 
Chapter 15 we will see why some phonologists nevertheless object to ambisyllabic 
representations. There we will also point out that those objections should not be taken 
too seriously. 


(32) a o CT 
LS 


人 WV 


10.7.1 Ambisyllabicity in English 


English has two post-MOP syllabification rules. taison, which applies across word 
boundaries, causes a word-final consonant to be in the onset of a following vowel- 
initial word. This is shown in (33). The rule only applies across word boundaries, 
because word-internally onsets will have been created by the MOP. As a result of 
(33), a consonant like [v] in five is both syllable-final and syllable-initial. After 
the application of Liaison, the word-initial vowel can no longer be preceded by a 
glottal stop. 


(33) LIAISON © o 
C V 
(34) farv egz [v] five eggs 


ould inglif [d] Old English 
sauð æfrəkə [9] South Africa 
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The other rule is RIGHT capture, which causes the first consonant of a weak syllable to 
serve as a (final) coda consonant of a preceding syllable.’ That is, it is a syllabification 
rule which applies within the domain of the foot. As explained in Chapter 13, the 
English foot is left-dominant, in which the weak syllable(s) typically have [3,1,0] or, 
especially word-finally, [i,ou]. Foot-based RIGHT capture, given in (35), spreads an 
onset consonant to a preceding syllable, where it forms part of its coda. 


(35) RIGHTCAPTURE (...0 © ...)F 


‘| 


C 


As a result of (35), the [p] in a word like happy will be the final consonant of the first syl- 
lable and the first consonant of the second syllable. As in the case of MOP, we must make 
the assumption that the resultant coda is a well-formed coda of English. That is, RIGHT 
CAPTURE cannot create a syllable *[bedf] in ['bedfard] Bedford, even though this word is 
a single foot. Examples of words in which consonants become ambisyllabic by (35) are 
given in (36). In (36a), the rule applies after an open syllable, in (36b,c) it applies after a 
closed syllable, while (36d) shows that the left-hand syllable may be unstressed. 


(36) a. ‘siti [t] city 
b. 'ka:nsərt [s] concert 
c. 'kantri [t] country 
d. a'speragas [r], [g] asparagus 


As pointed out by Kahn (1976), American English FLAPPING,the rule that causes certain 
instances of intervocalic [t,d] to be pronounced as an alveolar flap, provides evidence 
that ambisyllabic consonants are a natural class. FLAPPING applies in what are at first 
sight two entirely unrelated contexts. In (37), the contexts in which the rule applies are 
compared with a context in which it does not. The curious generalization that emerges 
is that FLAPPING applies when the right-hand vowel is in the next word, or is in a weak 
syllable in the same word. 


(37) FLAPPING applies NO FLAPPING FLAPPING applies 
['lertər] later ['lerteks] latex ['lert eks] late ex 
[hrtm] hitting [‘hitart] Hittite [hit ‘aik] hit Ike 
['siti] city ['sætarər] satire [‘nait aul] night owl 


The unifying element of the contexts in which FLAPPING applies is ambisyllabicity. 
LIAISON and RIGHT CAPTURE, Which can both be motivated on independent grounds 
(cf.also Gussenhoven 1986, Rubach 1996), provide the appropriate representations for 
rule (38), which simply says ‘Flap [t,d] when they are ambisyllabic’. 


* This rule was originally seen as one that applied only if the left-hand syllable was open (Kahn 1976, Clements 
and Keyser 1983). The more general formulation in (35) was proposed and motivated in Gussenhoven (1986). 
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q 
9 


(38) FrapPixe [td] —> [r]/ 


| -o< 


[一 cons] 


Neither the linear theory of SPE nor the resyllabification theory yields an elegant 
account of these facts. In a linear theory, we would arrive at (38), after Kahn (1976), 
where the angled brackets indicate mutually dependent terms. 


村 +syl 
(39) riareinc [td] 一 [r]/[—cons] __ <##>, a —stress =~ 


Condition: if not a, then b 


In addition to the awkward condition ‘within the word, the following vowel must be 
unstressed’, we also need to stipulate that in slow, deliberate speech styles, in which 
rules like Liaison do not apply, word-final [t,d] will not be flapped. This will make it nec- 
essary to add a second condition: ‘In unconnected speech, not a’. Clearly, the ambisyl- 
labic representation is superior to a linear analysis without syllable structure. It is also 
superior to a resyllabification theory, such as offered by Selkirk (1982). Selkirk assumes 
that FLAPPING applies to coda consonants. In order for this account to go through, there 
must be no LIAISON (in order to keep final [t,d] in coda position), while RIGHT CAPTURE 
must be replaced with a rule that delinks the initial onset consonant of a weak syllable 
and moves it into the coda of a preceding syllable, so that get $ Anne, get $ it and gett$ing 
are syllabified as indicated by $. The disadvantages of this analysis are, first, that real 
coda consonants, like [d] in Too bad! or [t] in The lot, are characterized as flapped, a 
claim which requires independent evidence. Second, an explanation is required for the 
fact that [t,d] in ‘real flapping contexts are neutralized (cf. bidder ~ bitter), but contrast 
in the ‘real’ coda (bid - bit). Third, the claim that weak syllables in words like city or 
sadder are onsetless is difficult to square with the general finding that onsetless sylla- 
bles may be pronounced with a preceding [?]. Moreover, as was observed above, there 
is the general drawback that the resyllabification option will have to be constrained, 
since it would otherwise freely allow changes of onsets into codas and vice versa. 

In moraic structure, ambisyllabic consonants have an association with the last mora 
of the syllable it closes and one with the onset of the next, as shown in (40a) for Old 
English, It is an interesting fact that languages do not appear to contrast ambisyllabic 
and geminate consonants, which suggests that they be represented identically within 
words (van der Hulst 1985). For English, this assumption leads to (40b) as the represen- 
tation of city. For English, we are assuming that unstressed syllables have only one mora. 


(0) a g ø o b o 
es 
BRE) pe |p uu p 
ALTI TA | | 
ould Igy gli f Sit ot 
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The structure of the syllable rhyme of Brazilian Portuguese (BP) is V(X), where X 
can be C or V. BP has the nasals [m,n,nl. The [p] distinguishes itself from the other 

nasals in a number of respects. First, while [m,n] can appear word-initially, as shown 
in (1a,b), [p] cannot, as shown in (1c), where V stands for any vowel (Wetzels 1997b). 


(1) a. ‘movel ‘mobile’ 
D nore ‘north’ 
os 


Second, although diphthongs are freely formed out of consecutive vowels before other nasals 
(2a,b), they do not occur before [p]. Instead, the two vowels are divided over two syllables (2c,d). 


(2) a. ‘rei.nu ‘kingdom’ 
b. an. dai.mu ‘stage’ 
c  fu.'ina *“fuina ‘weasel’ 
d. ra'ina *'raipa ‘queen’ 


Third, while [m,n] tolerate a consonant immediately before them (3a,b,c,d), [p] does not (3e). 


(3) a. a'dornu ‘ornament’ 
b. ‘arma ‘weapon’ 
c. ‘alma ‘soul’ 
d. vulne'ravel ‘vulnerable’ 


e.  *VrpV, *VIpV 


The three differences between [p] on the one hand and [m,n] on the other can be explained 
if [n] is incorporated differently in syllable structure from [m,n]. 


1. 


What is this structural difference? 


BP has a rule of vowet NASALIZATION which causes a stressed vowel to be nasalized before a 
nasal, as shown in (4a,b). Unstressed vowels remain oral before nasal consonants, as shown 
in (4c,d). However, before [n], vowels are always nasalized, as shown in (4e,f). 


(4) a. 


‘sinu ‘bell’ 


b. ‘kama ‘bed’ 
c.  bo'neka ‘doll’ 
d.  kuma'ri ‘chili’ 
e. arina. ‘spider’ 
f.  di‘peiru ‘money’ 


It is possible to account for the nasalization of stressed vowels before [m,n] and the lack of 
nasalization of unstressed vowels before [m,n] by assuming that in stressed syllables consonants 
like [m,n] are incorporated differently in syllable structure than in unstressed syllables. 


What is that assumption? 
3. What is — under that assumption — the correct generalization for VOWEL NASALIZATION? 


4. How does your solution account for the fact that before [n] both stressed and unstressed 
vowels are nasalized? 
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Q91 In some dialects of Spanish (Harris 1983), [r] and [I] are realized as [j] in certain 


positions, as illustrated in (1). . 

(1) revolver [revojvej] ‘revolver’ 
carta [kajta] ‘card’ 
papel {papej] ‘paper’ 
calor [kalj] ‘heat’ 


Give an account in SPE theory as well as in a theory that can refer to the syllable, and argue 
that the latter is superior. 


Q92 British English (RP) [r] can be realized either as a flap [r] or an approximant {1} in the 
expressions in columns | and Ill, but in those in column ll only an approximant 
realization is available (Rubach 1996). 


courage courageous for example 
very reduce for instance 
baron red the other end 
laurel bright 


Argue that these data support the existence of ambisyllabic consonants in English. 


10.8 _ CONCLUSION 


In this chapter we have seen that segmental duration can be accounted for by 
assuming that there is an independent tier of skeletal slots which constitutes an inter- 
mediate level of structure between the segments and the syllable. Moreover, by 
differentiating between C slots and V slots, the feature [+ syllabic] can be dispensed 
with. In addition, the elements on this tier serve to define the morphemes for verb 
aspect in Arabic. By allowing C slots to be unsyllabified, it also appeared possible to 
account for French arson, while an empty C slot could be used to account for h-aspiré 
in French. However, a distinction must be made between segments that determine 
the status of the syllable with respect to foot assignment and cause compensatory 
durational effects on the one hand, and those that do not on the other. The mora serves 
to represent this special prosodic status of these segments. The moraic theory predicts 
that compensatory effects and sensitivity to stress rules go hand in hand in any 
one language. It is usually assumed that the mora tier should replace, rather than 
supplement, the CV tier. In this chapter we have also seen that the syllable plays 
an active role in the SDs of phonological rules, which often appear sensitive to whether 


156 Between the segment and the syllable 


a consonant is in the coda or in the onset. Additionally, it has been shown that 
ambisyllabicity is a form of representation required for the descripiton of a number of 
processes. Before moving on to the constituent above the syllable, the foot, in the next 
two chapters we will consider the segment-internal structure, and show how it plays a 
role in the characterization of a wide variety of phonological processes. 


11 


Feature geometry 


11.1 INTRODUCTION 


In the previous two chapters, we have seen that tone and segmental duration must be 
autosegmentalized in order to account for a number of phonological phenomena. In 
this chapter, we will continue this theme. We will replace the matrix comprising the 
remaining distinctive features with a hierarchical configuration of these features. We 
will begin by considering a number of arguments for replacing the unstructured list of 
laryngeal, manner and place features with a feature tree. The main advantage of this 
‘autosegmentalized’ representation is that features or particular groups of features can 
spread to neighbouring segments, which greatly improves the description of assimila- 
tion processes. Moreover, the assumption made in Chapter 5 that not all segments are 
specified for all features appears to have interesting consequences in the new model. 
Some of these will be discussed in this chapter, while others will be discussed in 
Chapter 12, which continues this topic. 


11.2 TWO PROPERTIES OF ASSIMILATIONS 


Assimilations frequently show a non-arbitrary relationship between the structural 
description and the structural change. Time and again, rules appear to transfer some 
feature or features from one segment to a neighbouring segment. However, the repre- 
sentation we introduced in Chapter 6 is incapable of expressing this relationship. To 
illustrate the problem, consider once more the Dutch rule of REGRESSIVE voICING, which 
voices obstruents before [b,d]. Rule (1) might be said to be a natural rule in the sense 
that voiced segments cause preceding segments to be voiced. It would be very strange 
indeed if Dutch had a rule which caused segments to become voiceless before [b,d]. 
The problem with the SPE notation, however, is that it is in fact as easy to write a rule 
that describes such an implausible process (cf. (2)) as it is to write a perfectly ordinary 
rule like (1). That is, the notation represents a theory which cannot distinguish 
between the impossible and the commonplace.! 


! This is not to suggest that dissimilation processes are in general impossible. What is implausible is that a 
dissimilation rule like (2) would ever be needed to describe a phonological process. 
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”一 


—son 


(1) recressive VOICING [一 son] — [十 voice] /__(##) cont 
+voice 


—son 


(2) impaustBLe RULE [+son] 一 [—voice] /__(##) | —cont 
+voice 


What would be an elegant way of expressing the fact that assimilations involve the 
adoption of a feature by one segment from another? First, we might think of a copying 
operation. This would produce a non-arbitrary result in that only features and feature 
values that are present in the context can be copied. However, as we have seen in 
Chapter 9, phonologies show a pervasive dislike of sequences of identical features (the 
OCP), and it would obviously be a bad idea to design a theory of rules whose essence 
it is to violate the OCP. A better solution would be to find a representation which would 
allow segments to share the same feature(s), much in the way successive TBUs can be 
associated with the same tone. 

There is a second striking fact about assimilations. Commonly, the same groups of 
features appear to be transferred from one segment to the next. For example, many 
languages have processes that transfer the features specifying a consonant’s place 
of articulation to a preceding nasal consonant. Since the features in the matrix are 
unstructured, it is impossible to express that the feature groupings we find in the 
processes of so many languages are in fact natural groups of features. That is, in a 
theory without a feature tree, an unnatural feature group like [LABIAL, +son] can be 
as easily expressed as a natural one like [+ voice, +spread, +constr]. 

The solution to both problems is known as feature geometry. The problem of the 
grouping of features is solved by representing segments as trees, in which the nodes 
represent features and feature groups. The assimilation problem is solved by assum- 
ing that a single node may be part of more than one tree. We will first introduce the 
feature tree. Then, in section 11.5, we turn to the ways in which one feature tree 
(i.e. segment) can interact with an adjacent feature tree. 


11.3 NATURAL FEATURE CLASSES 


As was observed above, assimilations of place involve the transfer of a collection of fea- 
tures. Many languages have rules assimilating nasal consonants to the place of articula- 
tion of the following consonant. For instance, Hindi, which has the underlying nasals 
[m,n,n], assimilates the place of articulation of nasal consonants to that of a following 
consonant within the word. Word-internally, sequences like *[nk]; *[nt] or *[mt] are 
therefore ill-formed. As a result of NASAL ASSIMILATION, a prefix like [sam] ‘together’ shows 
alternations of the type illustrated in (3) (Ohala 1983). Clearly, the place features must be 
accessible as a group in order to express this phonological process: the assimilation 
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involves the transfer of all the place features, including dependent ones like [+anterior], 
as shown by the words for ‘equilibrium’ and ‘movement’. 


(3) arka:r ‘shape’ samatkair ‘homophonous’ 
ki:rtan ‘devotional singing saņki:rtan ‘collective devotional singing 
to:l ‘measure’ santo:l ‘equilibrium 
calan ‘conduct sancalan ‘movement’ 
naid ‘sound’ sannaid ‘consonance’ 


There are also processes that transfer a subgroup of the place features. English coroNAL 
ASSIMILATION assimilates [t,d,n,l] in place of articulation to the following coronal conso- 
nant. That is, before [0,6] they are dental, before [t,d,n,l] they remain alveolar, 
and before [1], which in English has a postalveolar place of articulation, they are 
postalveolar. Recall that the feature values for the segments concerned are as given in 
(4). The data in (5) illustrate that the values of the features [anterior] and [distributed] 
are together passed on to the preceding [—cont, +cor] segment (Clements 1985). 


(4) Cor Ant Distr 
0, ð, n + + =f 
t,d,n,l + + = 
i,t + 7 + 

(5) maoum in Rome 
oil dea all there 
in 03:sk in Thirsk 
tual try 
get Jest get rest 
wid0 width 


Of course, single features are also frequently transferred from one segment to the next. 
Consider the Old English data in Chapter 4 (p. 50) which showed that the [一 back] [ç] 
appeared after [一 back] vowels, and [+back] [x] appeared after [十 back] vowels. 
This process is understandable if we assume that the vowel transfers the value for the 
feature [back] (but no other features) to the following dorsal fricative. Just as phono- 
logical processes are seen as evidence for the existence of natural segment classes (any 
group of segments referred to by a process), so a natural feature class can be defined 
as a group of features that is manipulated (i.e. transferred, deleted or inserted) by 
some phonological process. 


11.4 BUILDING A TREE 


So far, the assumption has been that the representation of a segment is an unstruc- 
tured list of features. Spelt out in full, the representation of [t] would be as in (6a). 
What rules like Hindi nasar PLACE ASSIMILATION suggest is that the representation must be 


? This rule should arguably be seen as a rule of phonetic implementation (Hayes 1992). 
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Co 一 


as in (6b). Further, on the basis of processes like that illustrated in (5) we would seem 
to need a representation like that in (6c). 


6) a b. c. 
+cons +cons +cons 
—son —son —son 
—voice —voice —voice 
—spread —spread —spread 
—constr —constr —constr 
—nas —nas —nas 
—lat —lat —lat 
—cont —cont —cont 
COR COR COR 
十 ant 十 ant 十 ant 
一 distr —distr —distr 


What this means, of course, is that the segment is not a single constituent, but has 
internal structure. Such constituency is conventionally represented by means of a tree 
diagram. The development of feature trees (or feature geometry) is mainly due to 
Clements (1985), Sagey (1986) and McCarthy (1988). 


11.4.1 The place node 


The node dominating the place features is called the PLACE node. Within this constituent 
there are four articulator nodes corresponding to the univalent features [LABIAL], [coro- 
NAL], [DorsAL], [RADICAL]. Each of these four nodes dominates subconstituents correspon- 
ding to their relevant features. Thus, the [LABIAL] node dominates [+round] and 
[distr], the [coronaL] node dominates [tant] and [+distr] and the [porsat] node 
dominates [ +back], [high] and [+low]. This representation of the PLACE node is given 
in (7). With the help of this representation, processes can refer to the constituent [LABIAL, 
distr, round], but not to the nonconstituent [high, ant], for instance. We will assume that 
the position of [+ tense] will be under [rap], assuming it is equivalent to [+ ATR]. 


(7) PLACE 


LABIAL COR DORSAL RAD 


distr round ant distr back high low tense 


Q93 


1. Which constituent is transferred from a consonant to a preceding [t,d,n,! ] by English 


CORONAL ASSIMILATION? 
2. Which constituent is transferred from a consonant to a preceding nasal by Hindi NASAL 
ASSIMILATION? f 
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11.4.2 The laryngeal node 


What other natural feature classes are there besides place features? Lass (1976: 145ff.) 
has argued that the laryngeal segments [h,?] have no supralaryngeal specification at 
all. He based his case on the frequently observed phenomenon that obstruents some- 
times appear to lose all their supralaryngeal information. Consider the London 
English data in (8), which are representative of this kind of process. In certain con- 
texts, preglottalized plosives are pronounced as [?]. (The variation between preglot- 
talized plosives and glottal stops is stylistic, i.e. both pronunciations may be heard 
from the same speaker.) 


(8) pr?ktfa or pi?tfa picture 
mui?t wiljam or mii? wiljam meet William 
ni?k ne?ks or ni? ne?ks knick-knacks 
no?t neu or no? neu not now 
kri?p smailin or kii? smailin keep smiling 


In New World Spanish, [s] has been lost in coda position. That is, the plural of [klase] 
‘class’, which in Peninsular Spanish is [klases], is [klase]. (The [e] arises from a sepa- 
rate process which is not relevant here.) However, the [s ] has not left without a trace: 
an [h]-like offglide can be heard after the plural in careful speech. Similarly, voiceless 
sonorants have arisen word-internally after the loss of the oral articulation of [s], as 
shown in (9). 


(9) Stage I Stage II 
mismo miimo or mimmo ‘same’ 
fosforo fooforo ‘match’ 


Both processes could be described if, following Lass, we assume that laryngeal 
and supralaryngeal features are separate constituents. Under that assumption, 
both the English and the Spanish data would receive the same description: deletion 
of the supralaryngeal constituent. In the case of English, this would leave 
[+constr, 一 voice, — spread] behind, i.e. [?], and in the case of Spanish it would leave 
[—constr, —voice, —spread] behind, i.e. a voiceless interval at that point in 
the word. Thus, such orphaned laryngeal features generally end up as [?] and [h], 
respectively. 

To express Lasss proposal in our feature tree, we need to assume that segments 
consist of two main parts: the laryngeal and the supralaryngeal constituent. The dom- 
inating node is referred to as the roor node. If we want to ensure that segments that 
have only one of these constituents can be specified for the major class features 
([+cons] and [+son]), these features must be present in either of these constituents. 
Following McCarthy (1988), we assume they make up the root node. This decision can 
safely be made, since no processes have been reported that transfer [+cons] or 
[+son] from one segment to the next. This means that our tree now looks like (10). 
We will treat the feature [+ approx] as a manner feature. 
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(10) ROOT 
cons 
son 
LAR La 
spread voice constr LABIAL COR DORSAL RAD 


dist round ant dist back high low tense 


11.4.3 The supralaryngeal node 


In the preceding sections, we have been able to establish a constituent structure for the 
major class features, the laryngeal features and the place features. What remains to be 
determined is the location of the manner features [+ nas], [+ cont] and [+lat]. Do 
they form a third constituent, or are they located inside the PLACESUPRALAR constituent? 
We will argue that [continuant] is indeed located in the right-hand branch of (10) on 
the basis of an assimilation process changing [h] to [ç] in Frisian (Tiersma 1985). 
Recall that [h,?] are not specified for [+ cont], and that this was the explanation for 
the fact that it did not trigger NASALIZATION. This rule nasalizes a vowel before [n], pro- 
vided a [+ cont] consonant follows. In (11a) the effect of the rule is illustrated, while 
(11b) shows that [h] does not trigger it (cf. Chapter 5). 


(11) Underlying Surface 
a. in-jqm jain ‘to give in’ 
oan-flrana oafliona ‘to attack’ 
b.  in-helja inhelja ‘to hold in 
oan-hiara oanhiera ‘to listen to” 


The southern variety of Frisian has a rule of DipHTHONG SHIFT, which - among other vocalic 
changes — turns [19] into [je]. When this [j] appears after [h], it assimilates [h] to [ç]. This 
rule can be used to decide which constituent, the laryngeal or the supralaryngeal con- 
stituent, accommodates the feature [continuant]. If this feature is part of the same con- 
stituent as the PLACESUPRALAR node, the assimilation rule will transfer the [+ cont] of [j], 
along with its place features, to [h]. This would predict that after the assimilation, [ç] will 
trigger NASALIZATION. However, if [+ cont] is not in the PLACESUPRALAR constituent and the 
place features can transfer without bringing [+ cont] along with them, the prediction is 
that [ç] will not trigger NASALIZATION, because it would still lack a specification for [contin- 
uant], like [h]. The data in (12) show that the first assumption is correct: the new [ç] trig- 
gers NASALIZATION, and must therefore be [+cont]. This segment is now [—son, 
—cons, +spread, —voice] (which is what [h] contributes), as well as [+ cont, —lat, 
—nas, cor, —ant] (which is what {j] contributes). 


> Well-formedness rules must change the segment into [十 cons], because it now has a constriction causing 
friction. 
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(12) Underlying Northern Southern 
in-hiakja inhtakja igjekja ‘to hook in 
oan-hrera oanhiara oõçjerə ‘to listen to’ 


When Frisian [h] is transformed to [¢], it acquires place and manner features from 
another segment, but it does not itself lose any place or manner features, because it 
did not have any. Segments other than [h,fi,?] do of course have manner and place fea- 
tures. The location of manner features in the pLACE/SUPRALAR constituent in (10) predicts 
that segments can lose these features, while acquiring new ones from a neighbouring 
segment. Such a transfer should be possible independently of the laryngeal features. 
This can indeed be shown for the lateral consonants of Klamath. In addition to a 
voiced [1], Klamath has a glottalized [l] and a voiceless [|]. When appearing after [n], 
the lateral consonant transfers its lateral articulation to the preceding segment, but 
without its laryngeal features, which remain behind as [?] and [h], as shown in (13). 
This suggests that there is a constituent in the feature tree that contains manner but 
not the laryngeal features. Since the new segment is not [+ nas], this Klamath process 
suggests that the PLACESUPRALAR node of (10) contains [| +lat] (because it is transferred 
from the second to the first segment) as well as [ nas] (because it is deleted from the 
second segment). 


(13) [nf] — [I] 
[nl] — [ih] 


While it is clear that the manner features are inside the supralaryngeal constituent, it 
is also clear that the manner features cannot be under the piace node. In Hindi, as in 
many other languages, it is just the consonant’s place node that is transferred to a pre- 
ceding nasal, not its [一 nas] specification. From the node labelled pLACESUPRALAR in 
(10), we must therefore split off the PLACE node. Since there have been no reports of 
processes that transfer [+nas], [+cont], [lat], [+strid], [approx], en bloc, from 
one segment to the next, these features are not grouped in a manner constituent, but 
rather form separate terminal nodes dominated by SUpRALAR, as shown in (14). 


(14) ROOT 
cons 
son 
LAR SUPRALAR 
spread voice constr nas cont lat sind approx 
PLACE 
LABIAL COR DORSAL RAD 


dist round ant dist back high low tense 
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Single manner features can be shown to transfer independently to neighbouring 
segments. Processes that transfer [ + nas] are widely attested. (We have seen examples 
of this in American English and Frisian here and in earlier chapters.) Transfer of 
[cont] is much rarer. In American English, forms like [gibm] for given, [hi:dn] for 
heathen, (bidnas] for business and [wadnt] for wasn't occur (Bailey 1985: 63). No case 
has been made for the transfer of [+strid], [lat] or [approx]. There are many 
proposals that modify the structure of (14); however, this structure is one that is 
widely used, and may be seen as a ‘consensus tree (Broe 1992). 


11.5 SPREADING AND DELINKING 


The structure of the tree introduced in the previous section explains what must be 
made accessible to phonological rules, i.e. what the natural feature classes are. In this 
section, we will deal with the way features or subconstituents of the segment tree 
spread or delink. In our representation, segments are arranged from left to right, 
which is the usual metaphor for the order from ‘early’ to ‘late’. Along this time axis, the 
segment trees are arranged much in the way the records in an old-fashioned jukebox 
are arranged: a row of parallel disks, where a disk represents a segment. Each disk 
defines a plane in which some instantiation of the feature tree is to be found. If you 
were to take out a disk to look at its right or left face, you would see some version of 
(14). This is known as an end view. 

How can a node of some segment be dominated by the appropriate node of an adja- 
cent segment? Imagine two adjacent segments S1 and $2. Now mentally draw a line 
from the [nas] node of S1 to the [nas] node of $2, and a line from the supRALAR 
node of S1 to the supratar node of S2. The lines you have just drawn are, of course, 
tiers. All corresponding nodes form tiers in this way, so that we can talk about the 
‘nasal tier’, the ‘supralaryngeal tier’, the ‘root tier’, etc. Evidently, tiers are adjacent if no 
tier intervenes between them. This is the case with the node rrace and the immediately 
dominating suPRALARYNGEAL node, for instance: no node intervenes between them. Two 
adjacent tiers define a plane. 


Q94 


Draw two feature trees in pencil, next to each other. Draw all tiers. Now erase all pencil lines 


that are ‘behind’ the planes nearest to you. As you will realize, your picture shows three 
dimensions. 


We can now draw, in some plane, an association line from any node to the imme- 
diately dominating node in an adjacent segment. That is, from the [LABIAL] node of S1 
we can draw an association line to the PLACE node of $2, and from the PLACE node of S1 
to the suPRALARYNGEAL node of $2, and so on. This spreading of nodes in the feature tree 


is entirely comparable with the spreading of tones, or with the association of one 
vowel to two moras. 
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Q95 Can you draw an association line from the PLACE node of S1 to the roor node of S2? 
lf not, why not? à 


In Hindi PLACE ASSIMILATION, the PLACE node of the right-hand segment associates with 
the SUpRALARYNGEAL node of the left-hand segment. This is shown in (15) by the inter- 
rupted association line. Notice that there is something wrong now: what was an [n] is 
now specified for two sets of place features. What we need to do is remove the original 
PLACE constituent. This operation is known as delinking, and is symbolized by ‘=’. 
Delinked nodes are deleted. 


(15) ROOT 
+cons | ROOT 
+son +cons 
SUPRALAR SUPRALAR 
[+nas] i 
PLACE PLACE 


| | 


[ed [eA 


11.5.1 Writing rules 


It would be a very daunting prospect indeed if all rules were to be formulated by draw- 
ing these elaborate three-dimensional pictures. What we would need to do is draw just 
those elements in the configuration that are involved in the spreading and delinking, 
and make sure that the appropriate contextual conditions are expressed. We will do this 
in an adaptation of a notation proposed by Clements (1985). The adaptation consists in 
separating out two aspects in the rule. First, the Structural Change (SC) is identified: 
which node spreads to the appropriate dominating node in the adjacent segment (with 
or without subsequent delinking of the original association line in that segment)? This 
is shown by specifying two tiers, one of which immediately dominates the other, 
together with the corresponding nodes of two adjacent segments. To express the assimi- 
lation, it shows either an association towards the left - regressive assimilation - or 
one towards the right - progressive assimilation. In (16) these are the supralaryngeal 
tier and the pace tier, while the right-hand place node is specified as the spreading one. 
Second, the Structural Description (SD) specifies what phonological properties the two 
segments concerned must have for the change to go through. These properties (fea- 
tures, usually) are simply listed below the two segments concerned. In not specifying 
the SD in the tree itself, we express the fact that the tree was motivated on the basis 
of natural feature classes that are actually manipulated by phonological processes. 
By contrast, features in the SD merely condition those processes. 

The rule in (16) thus reads as follows: the PLACE node of any consonant spreads to 
(the supralaryngeal node of) a nasal on the left, while the PLACE node of the latter is 
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delinked. Observe that in our interpretation of Clements’ display, the upper and lower 
tier are always adjacent. The higher node label (here: supratar) is therefore redundant. 


(16) NASAL ASSIMILATION SUPRALAR 


SE: X 
X 
PLACE 


SD: [+nas] [+cons] 
Similarly, English coronar ASSIMILATION is given in (17): ‘Spread the coronal node 
(with all the features it dominates) to a noncontinuant coronal on the left’ 


(17) | CORONAL ASSIMILATION PLACE 


CORONAL 
SD: [cont ] 


We saw in Chapter 5 that distinctive features were phonetically non-abstract in the 
sense that they can be defined in terms of phonetic concepts. Would you say that 
natural feature classes are phonetically abstract? 


English has an optional rule assimilating [t,d,n] to the place of articulation of 
following velar and labial plosives and nasals, as shown in the data below. Formulate 


this English PLACE assimiLation in the display format. 


81m buk thin book 
Bek kap that cup 

gub bor "good boy 
tem maiz ten miles 
Set nart that night 


11.6 IMPLICATIONS OF UNDERSPECIFICATION 


In a number of ways, the proposal to represent the phonological content of segments 
as a feature tree has led to more insightful formulations of phonological rules. As 
explained in Chapter 5, two assumptions have been made here. One is that not all 


Implications of underspecification 4167 


segments have all nodes. Thus, laryngeal segments have no supralaryngeal node, and 
the place node has only those articulator nodes that positively specify the place of 
articulation of the segment, which implies, for instance, that [t] does not have a labial, 
dorsal or radical node (Sagey 1986, McCarthy 1988). Second, it has been assumed that 
predictable information is absent in underlying representation, and is supplied by 
default rules which apply at the end of the derivation. Below, a number of advantages 
of such underspecification in feature trees are briefly outlined. 


11.6.1 Default rules 


The decision to leave predictable features unspecified may have a simplifying effect on 
the formulation of rules that neutralize contrasts to some neutral, ‘default realization. Let 
us take (Low) German FINAL DEVOICING as an example. As we have seen in Chapter 5, the 
expected laryngeal feature specification for obstruents is [一 voice, — spread, —constr]. 
Let us assume that these values are filled in before the phonetic implementation rules 
start their work if no other value is specified, causing all obstruents without laryngeal 
specification to be voiceless, unglottalized and unaspirated. FINAL DEVOICING neutralizes the 
distinction between voiced and voiceless obstruents in the coda (cf. the uninflected and 
genitive forms of ‘variegated’, [bunt - buntas], and ‘union’, [bunt - bund3s]). If a default 
rule were assumed for German which supplies the value [一 voice] to obstruents in the 
coda, FINAL DEVOICING could consist of a rule delinking the laryngeal node in coda position. 
In (18), we formulate this version of the rule. 


(18) FINAL DEVOICING ROOT 


SC: 
LARYNGEAL 
SD: Coda 


A separate question is whether voiceless obstruents in a language like German can be 
left unspecified for the feature [一 voice] in the lexicon. Even though [+ voice] is con- 
trastive for obstruents in German, which has contrasts like [pain] ‘pain’ - [bain] ‘leg’, 
it would be possible to leave voiceless obstruents unspecified, while specifying voiced 
obstruents as [+voice]. The voiceless obstruents then only acquire their [一 voice] 
specification at the end of the derivation, or, if the feature needs to be referred to by 
some rule, as early as it is needed. This would be the assumption made in the theory 
of Radical Underspecification (Kiparsky 1982, 1993, Archangeli 1988), which, as its 
name suggests, goes one step further than the theory of Contrastive Underspecifica- 
tion, which was briefly discussed in Chapter 5. In Radical Underspecification, only one 
value of a feature is ever lexically specified. A survey of the issues is provided by Roca 
(1994: ch. 2) and Steriade (1995). In the next chapter, we will discuss a number of 


further instances in which underspecification would appear to facilitate the application 
of rules. 
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Q98 Two northern dialects of British English each have a postlexical assimilation rule 
affecting consonant sequences across word boundaries (Wells 1982, Kerswill 1987). 
Neither rule occurs in the standard accent (RP). 


1. Give SPE formulations of the two rules. 
2. _ Explain why it would not be possible to assume that a default rule supplies the feature 
[—voice] to English voiceless obstruents at the end of the derivation. 


Durham 


white blouse [tb] [tb] [db] 
wide shot - [df] t [df] 
ripe beans [pb] [pb] [bb] 
drab conditions [bk] [pk] [bk] 
black velvet [kv] [kv] [gv] 
five votes [w] [w] [w] 
rough boys [fb] [fb] [vo] 
this village [vy [sv] [zv] 

bad joke ， [dd 引 [dd3} [dd3] 
live performance [vp] [fp] [vp] 
Bradford [df] [tf] [df] 

that night [tn] [tn] fdn] 
at last [tl] [ti] [dl 

all true fit] fit] iy 


in Spain {ns} 


Q99 In Klamath, the underlying consonant pairs listed in the first column are pronounced 
as in the second column (cf. the data in (13); after Halle and Clements 1983). 
Demonstrate that two ordered rules are sufficient to derive the outputs correctly. 
Assume that voiced sonorant consonants are unspecified for [+ voice], and that 
a default rule supplies this feature to unspecified sonorants at the end of the 
derivation. Also assume that if a geminate arises as a result of a spreading operation, 
the two original feature trees merge under a single ROOT node. State in prose what 
these two rules should accomplish. Formulate the rules in autosegmental 
display notation. (Hint: Draw partial trees of a typical sequence like [nl’] before and 


after the change. Also draw a feature tree of [Il] after the merger under a single 
ROOT node.) 


Underlying 
nl 

n! 

nl’ 

I! 

li 
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Q100 In Brao, voiced, voiceless unaspirated, voiceless aspirated and preglottalized voiced 
plosives are contrastive in the onset, as illustrated by [dak] ‘walk (verb)’, [trus:] ‘fish’, 
[tun] ‘season’, [ ?du:r] ‘type of flute’ (the latter may vary with an implosive [dj). In the 
coda, the only plosives that can occur are voiceless unaspirated, as illustrated by 


[pe:t] ‘drink (verb)’ (Keller 1993). 


Provide the full laryngeal feature specifications for the four types of initial plosive. 
Which features could be supplied by default rules? 
What underlying laryngeal feature specification would you assume for the coda plosives? 


11.7 CONCLUSION 


If the representation of a segment consists of a self-contained list of features, it is 
impossible to express that, frequently, sequences of segments share particular fea- 
tures, a phenomenon known as assimilation. Also, the fact that such assimilations 
often involve specific groups of features, such as the features specifying the place of 
articulation, cannot be expressed. If, by contrast, the features are arranged in a con- 
stituent structure, or feature tree, with each constituent defining a tier, it does become 
possible for two adjacent segments to share the same feature. Also, because the feature 
tree can group features under a single constituent node, it will be possible for groups 
of features to be transferred, or spread, to an adjacent segment. 
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12.1 INTRODUCTION 


In this chapter, we continue our discussion of the feature tree. Two main topics will 
be dealt with. One concerns the opportunity that is afforded by the underspecification 
of features - and hence, the absence of particular nodes in the representations of 
segments - to characterize long-distance assimilations. Particularly relevant in this 
context are word-based restrictions on the distribution of certain features, such as is 
seen in Vowel Harmony, for instance. We will see that the solution to characterizing 
these distributional restrictions is reminiscent of the description of the lexical tone 
patterns of languages like Etung and Mende, discussed in Chapter 9. The second topic 
concerns the representation of complex segments. We will deal with the three types of 
complex segment recognized in Chapter 1. Also, we will return to the question of the 
representation of palatal and palatoalveolar segments, like [tf] and [j], and argue that 
they, too, are complex segments. 


12.2 LONG-DISTANCE ASSIMILATION 


It would be reasonable to expect that assimilations only occur between immediately 
adjacent segments. For instance, we would not expect to find a language in which the 
initial consonant of a word assimilates in place of articulation to the coda consonant 
of the second syllable. However, it is in fact not unusual to find cases in which an articu- 
lator node spreads to the dominating node of an apparently non-adjacent segment. 
If the No CROSSING CONSTRAINT introduced in Chapter 9 is valid in the segment tree, then 
this can mean only one thing: the intervening segment or segments must lack the 
node concerned. Long-distance processes of this kind have accordingly been used as 
evidence for underspecification. As an example, consider the Russian data in (1) 
(Kiparsky 1985). Sequences of obstruents in Russian, whether or not separated by 
a word boundary, agree in voicing with the rightmost one. This is shown in (1a). 
As shown in (1b), however, it does not appear to be the case that the obstruents must 
be strictly adjacent: a sonorant consonant may intervene. 
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(1) a. gorod-k-a gorotka ‘little town (genitive)’ 
mtsensk bil mtsenzg bil ‘it was Mtsensk 
b. iz mtsensk-a is mtsenska ‘from Mtsensk’ 
ot mzd-+ od mzdi ‘from the bribe’ 


Russian devoices final obstruents, which rule is assumed to apply before the assimila- 
tion of voice illustrated above. The spreading feature can accordingly be assumed to 
be [十 voice]. RUSSIAN VOICING ASSIMILATION is given in (2). The feature [十 voice] spreads 
left and any original specification for [voice] is lost through delinking. This spreading 
may take place more than once, to deal with sequences of obstruents. 


(2) RUSSIAN VOICING ASSIMILATION LARYNGEAL 


SC: > 
\ 
+ voice 


sp: [son] | yee | 


However, the spreading of [十 voice] from the rightmost obstruent to a non-adjacent 
obstruent can only proceed if sonorants do not have a specification for [+ voice] in 
the underlying representation. This is shown in (3), which shows that the assimilation 
goes right through an intervening sonorant consonant. The subsequent specification 
of the feature [十 voice] in sonorant segments is achieved by a default rule: Russian 
does not contrast voiced and voiceless sonorants.! 


(3) b LARYNGEAL 


SC: a 


+ voice 
SD: | —son | [ +son | fee | 


It has also been assumed that certain segments in principle refuse to bear certain 
nodes. As we have seen above, laryngeal segments are a case in point. Consequently, 
we can safely spread supralaryngeal nodes across [h,?], since these segments cannot 
bear such a node. The fact that nasalization processes frequently spread through 
laryngeal segments to vowels in adjacent syllables is often assumed to receive an 
explanation in this assumption. 


! In order to prevent (2) from applying across vowels, it must be stipulated that it does not cross a syllable node. 
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Another oft-cited case of long-distance spreading relies on the absence of a COR- 
ONAL node in noncoronal (labial and dorsal) segments. Sanskrit RETROFLEXION 
turns alveolar [n] into retroflex [n] after [t,s,n,t]. Retroflex consonants are con- 
trastively [— ant, —distr], as shown in (4) (Steriade 1987, Kenstowicz 1994a). 


We would at first sight expect the rule to be blocked by any consonant other than [n]. 
However, noncoronal consonants as well as vowels may freely intervene between context 
and focus, as shown in the first column of (5). By contrast, there may not be a coronal 
consonant other than [n] between target and focus, as shown in the second column of (5). 
(Underlyingly voiceless obstruents are shown as voiced between sonorant segments.) 


(5) Applies Blocked 
-na ‘presen? Is-na ‘seek’ mtd-na ‘be gracious’ 
-na ‘pass. partic? pu:t-na ‘fil? bhug-na ‘bend 
-aina ‘middle partic? put-aina fil? matj-aina ‘wipe 


ksubh-a:na ‘quake’ ksved-aina ‘hum’ 
-miana ‘middle partic? ktp-a-maina ‘lamen? krt-a-ma:na ‘cut’ 


The rule to accomplish this is given in (6). It relies crucially on the absence of a coro- 
nal node between the two segments. If there is a coronal node, the nasal cannot be 
reached, as the spreading association line would cross the association line between the 
coronal node and the piace node, as shown in (7). 


(6) RETROFLEXION 


PLACE 
7 
SC: Z 
CORONAL 
. f —ant —distr 
$D; B ] [ +nas | 
(1) 过 PLACE 
三 
wc. a 
ee 
CORONAL 


SD: [diste] [tee 
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Q101 In Sundanese, nasalization of vowels is predictable as a result of a phonological 
rule, NASALIZATION. Consider the following data (Robins 1957, Cohn 1990). 


一 


/Siar/ [siar] ‘seek’ 

/mawur/ [mãwur] ‘spread (Active) 
/panlin/ [panlin] ‘unrecognizable’ 
/najak/ [naijak] ‘sift (Active)’ 

/natur/ [natur] ‘arrange (Active)’ 
/kana/ [kana] - ‘for the purpose’ 
/ti?is/ [ti?is] ‘relax in a cool place’ 
Ini?is/ [ni?ig] ‘relax in a cool place (Active)’ 
/mahal/ [mahal] ‘expensive’ 

/canra/ [canra] ‘very nice weather’ 
/narahitan/ [narahitan] ‘wound (Active)’ 
/paho/ [pahd] ‘know (Active)’ 

/piar/ [iar] ‘seek (Active)’ 

1. Give a prose statement of NASALIZATION. 


2. Argue on the basis of these data that the feature tree for laryngeals does not include a 
supralaryngeal node. 


Q102 Stem-initial obstruents in Bakairi, a language whose syllable structure is (C)V, are 
realized as voiceless plosives, unless preceded by a vowel, when they are voiced 
(Wetzels 1997b). This is shown by [taka] ‘bow’, [te-daka-ge] ‘have a bow’. 
Astem-internal obstruent, however, can be voiced or voiceless, as shown by [itubi] 
‘skin’, [adapigo] ‘heat’. 


1. Assume that stem-initial obstruents are unspecified for [voice] in underlying 
representations. Which two default rules must be assumed? 


The distribution of stem-internal obstruents is not free. If stem-initial consonants are not 

counted, there can be only one voiceless obstruent in the stem, which must be either the first 

or the second obstruent (again, not counting any stem-initial consonant). Thus, while [taka] 

‘bow’, [adapiga] ‘heat’ and [ud>d>] ‘jaguar’ are fine, forms like *[itupi}, *{titupi], *[idebiko] are 

impossible. This state of affairs can be captured by assuming that Bakairi stems have a limited 

number of [+ voice] melodies, much in the way that Etung has a limited number of tone melodies. 

2. Assuming that [—son] segments are the [+ voice]-bearing segments, which [+ voice] 
melodies would you assume? Would the feature spread? 

3. How are [udod> ] and [adapig>] accounted for? 

4. Give the underlying forms of [taka] ‘bow’ and [tadakage] ‘have a bow’, indicating 
obstruents unspecified for voice with capital letters P, T, K. Illustrate the association of 
[+voice] melodies and the effects of the default rules. 


5. in [kon>pia] ‘little bird’, the voiceless [p] is the third consonant of the stem. Why isn’t this 
form ungrammatical? 
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12.2.1 Vowel harmony 


Vowel harmony can be seen as a subclass of long-distance assimilation. Many lan- 
guages exclude certain combinations of vowels in the word (for a survey, see van der 
Hulst and van de Weijer 1995). For instance, a Finnish word containing one of the back 
vowels [u,0,a] must not contain any of the front counterparts [y,ce,2]. Thus, possible 
words are [mite] ‘what’, [suomi] ‘Finland’, [talo] ‘house’, while *[tymo] and *[tume] 
are ill-formed. Since the vowels must apparently agree for the feature [+ back], Finnish 
represents a case of Back Harmony. Languages may also display Round Harmony, 
Height Harmony and ATR-Harmony. An example of the latter is Akan, in which the four 

_[+ATR] vowels [i,e,0,u] must not co-occur with any of their [— ATR] counterparts 
[1,€,9,U]. This situation is reminiscent of the restriction on tonal patterns in Ewe and 
Etung discussed in Chapter 9. Specifically, it would be reasonable to assume that, 
underlyingly, backness in Finnish and ATR-ness in Akan are properties of the word, 
rather than of the individual vowels. Thus, underlyingly, the vowels of a Finnish word 
like [pysæhtyæ] ‘to stop’ would be unspecified for [back]. Instead, the word appears 
in the lexicon with a ‘floating’ feature [一 back] (8a), which associates to the DORSAL 
nodes of all the vowels, as shown in (8b), which is equivalent to (8c). (In (8a,b) the 
underspecification for [back] is indicated by the capital letters. Thus, [A] could be 
[æ] or [a], etc.) Notice that there is little point in assuming a particular direction of 
association, as there is only one feature to be associated. The important advantage of 
the lexical representation in (8a) is that the mutual exclusion of [+ back] and [—back] 
vowels in the same word is naturally accounted for. 


(8) a. pUsAhtUA b. pUsAhtUA c. [pysahtyz] 
[一 back] NA 
[—back] 


The vowel systems of languages with vowel harmony frequently have one or more 
vowels which do not contrast for the harmony feature of the language. Thus, in addi- 
tion to the two sets of four vowels mentioned above, Akan has a [—ATR] [a], for 
which there is no [+ATR] counterpart *[a]. Similarly, Finnish has the two [— back] 
vowels [i,e], for which there are no [十 back] counterparts *(w,¥]. Depending on the 
language, such noncontrasting vowels behave in either of two ways. They may be 
opaque, in which case they stop the spreading of the feature with the opposite value 
and, moreover, impose their value on any following vowels. Alternatively, they are 
transparent, in which case they allow the spreading of the harmony feature to go 
right through them. Opaque vowels thus block and impose harmony, while transpar- 
ent vowels neither undergo nor impose it. Akan [a] is an opaque vowel, while Finnish 
[i,e] are transparent vowels. 

As shown by Clements (1981), the autosegmental model has no problem in charac- 
terizing the behaviour of opaque vowels. If we assume that the floating feature only 
spreads to vowels that are unspecified for the harmony feature, we predict that the 
spreading is stopped by any vowel that already has a specification. Akan words are 
made up of a root, which is marked either [+ ATR] or [—ATR], plus a number of 
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prefixes and suffixes. If the word contains only nonlow vowels, all the vowels agree 
with the feature value of the root, as illustrated by [e-bu-o] ‘nest’, which contains the 
[+ATR] root [bu], and [e-bu-9] ‘stone’, which contains the [—ATR] root [bu]. 
However, [a] may occur in combination with either [+ ATR] or [—ATR] vowels, as 
shown by the disyllabic roots [pirra] ‘to sweep’, [jart] ‘to be sick’, [bisa] ‘to ask’ and 
[kari] ‘to weigh’, where the latter two are ‘disharmonic’ (Clements 1981 ). Their lexical 
representations are given in (9). 


(9) a. plrA b. jArl c. bIsA d. kArl 
N 


Y ai, ts 
[一 ATR] [一 ATR] [+ATR][—ATR] [—ATR][+ATR] 


Disharmonic roots will have prefixes and suffixes that surface with opposite values for 
[ATR]: the floating [+ ATR] feature and the associated feature of [a] will each spread 
to their ‘half’ of the word, as shown in (10), where (10a,b) are equivalent to (10c,d), 
respectively. 


(10) a  O-blsA-I b. O-kArl-1 
bo AZ a 
[+ATR][—ATR] [一 ATRJ[+ATR] 


c. [o-bisa-1] d. [2-kari-i] 


Q103 Akan does not have words with [+ATR] vowels on both sides of [a]. Does the 
above account explain this? 


By contrast, the behaviour of transparent vowels has been problematic for the autoseg- 
mental model. It would be reasonable to want to represent transparent vowels as 
underspecified for the harmony feature of the language, so that they can let the spread- 
ing feature through. Recall, however, that the solution to the problem of vowel harmony 
was to leave the harmonizing vowels unspecified for the harmonizing feature. So how 
can a Back Harmony rule, for instance, distinguish between harmonizing and trans- 
parent vowels during its search for empty norsat nodes in vowels? It might be thought 
that the solution lies in underspecifying transparent vowels even further, and leaving 
the entire porsat node out. This might work in the case of languages that have only one 
transparent vowel, but not in the case of Finnish, which has two transparent vowels, [i] 
and [e], which contrast for a feature which in the consensus model is present on the 
DORSAL node: [ +high]. One solution that has been proposed is to appeal to a constraint 
banning the ill-formed vowels concerned. For Finnish, the constraint would be (11), 
according to which back unrounded vowels like [w] and [x] are disallowed. The idea is 
that rules are prevented from producing the structure concerned, but that their work- 
ing is otherwise unimpeded. This case may serve as a further argument for abandon- 
ing rule-based descriptions in favour of constraint-based ones. 


? The vowel [a] has a number of allophones, which we have not indicated. 
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(11) * [— cons] 
PLACE 
LAB COR 
fend [+back] 


Q104 The Finnish word game Siansaksa (‘Pig German’) produces the forms in the 
second column for the Finnish forms in the first (Vago 1988). 


saksalaisia haety:tet:i:n hæksælæisiæ satu:tet:i:n 
‘the Germans were attacked’ 

tyk:ae:n urheilusta uk:a:n tyrheilystæ 

‘| like sports’ 

otsansa hies:e hitsansa oesia 

‘in the sweat of his brow’ 


pite: kalasta kata: pilasta 
‘likes fish’ 


Give an analysis of Siansaksa. 

Does your analysis account for the fact that the first word in the game form in (a) is not 
*Theselaisie]? 

Does your analysis account for the fact that the second form in the game form in (a) is not 
*[saty:tet:i:n]? 

Does your analysis account for the fact that in (c) the first word in the game form is not 
*[hitsænsæ]? 


12.3 COMPLEX SEGMENTS 


Unlike simplex segments, complex segments have more than one specification either 
for place of articulation or a manner feature. Three types of complex segment can be 
distinguished. 


1. Complex place segments. A complex place segment has more than one articu- 
lator node, and more than one articulator therefore participates in realizing the 
constriction specified by the manner features in the root. That is,a double articul- 
ation is produced. Examples are [gb], [kp] and [nm], all of which are labial- 
dorsal. English [w] is a labial-dorsal approximant. Labial-coronal [pt] occurs in 
Bura and Margi. In section 12.3.2 we will see that prepalatal and palatoalveolar 
consonants like [j], [tf] and [c] are coronal-dorsal. In (12), an example is given 
of a segment with a labial as well as a coronal node. Of course, above the PLACE 
node the segment looks just like a simplex segment. This is an end-view: the 
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two articulator nodes lie in the segment plane, and are thus phonologically 
simultaneous. 


(12) PLACE 


LABIAL CORONAL 


2. Manner-contour segments. Manner-contour segments have a sequence of 
differently valued occurrences of the same manner feature. For example, 
prenasalized stops like [mb, nd, ng] are [十 nas] as well as [—nas]. Since 
[+nasal] defines a feature-plane, this can only be represented by arranging 
the two specifications in sequence, and a side-view of such a segment would 
look like (13). 


(13) ROOT 


<— SUPRALAR 


[+nas] [一 nas] 


A second type of manner-contour segment has been claimed by Lombardi 
(1990), among others, for affricates, like German [ts] as in [tsu:] ‘closed’ or 
English [tf] as in [tfu:] chew. In this view, affricates have a sequence of 
[—cont][+cont], instead of being simplex segments that are characterized by 
[+strid], to distinguish them from [—strid] plosives. The most important 
argument for the view that affricates are complex is that they are pronounced 
as complex segments, a closure followed by a narrowing causing friction. A 
second argument might be that affricates may have fricative portions that 
would be characterized as [— strid] if the friction constituted a segment by 
itself. In such cases, it may not be clear what the value of [strid] should be. 
However, there is also evidence that affricates function as strident plosives and 
are simply [一 cont]. If they were (also) specified as [+cont], they would be 
expected to pattern with fricatives, but this appears not to be the case. For 
instance, the Polish fricative [s] is assimilated to [J] by a following [tf], as in 
[ojtje] ‘thistle +SG,LOCATIVE’, from [ostfe], but the affricate [ts] is unaf- 
fected, as in [otstfe] ‘vinager ++SG,LOCATIVE’, *[otftfe] (Rubach 1984, Kim 
2001; see also Q41). Of course, under either analysis, affricates are single seg- 


ments, and we will leave the question whether they are simplex or complex 
undecided. 
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3. Secondary articulations. Consonants with a secondary articulation have two 
place specifications: one to indicate the location of the manner of articulation, 
and one to indicate a simultaneous vocalic articulation. That is, while the seg- 
ment is [+cons], and can have any configuration of manner features, it has an 
additional component specifying a vowel-like gesture of either the lips or the 
tongue body. A secondary articulation like labialization can combine with a 
labial, a coronal or a dorsal segment. The labialized segments [p"], [m ], [t"], 
[n"], [k"] and [ņ"] are only some of the contrastively labialized segments 
of Nambakaengo. Similar freedom of combination can be shown to exist for 
palatalization, velarization and pharyngealization. As explained in chapters 3 
and 9, there is however a strong tendency not to combine identical place speci- 
fications. While segments like [x"], [x"] are relatively common, labialized 
labials are rare. (Velarized velars have not been reported; indeed, it is not clear 
how such segments could be different from plain velars.) 

Clearly, the representation of secondary-articulation segments will involve 
the addition of a place node, and in having more than one place node, these seg- 
ments resemble complex-place segments. However, unlike what we find to be 
the case in complex-place segments, the two place specifications of secondary- 
articulation segments differ in manner of articulation. To indicate which of the 
place features specifies the secondary articulation and which specifies the con- 
sonantal place, it is customary to mark either the consonantal place or the 
vocalic place with some diacritic, as has been done in (14), which shows a par- 
tial representation of [pi]. In Clements and Hume (1995), a less arbitrary solu- 
tion to this problem is offered. In this proposal, the feature tree includes a 
vocalic place node in addition to a consonantal place node, each of which dom- 
inates the articulator nodes Labial, Coronal, Dorsal and Radical. A feature like 
Coronal is interpreted as a front articulation in the case of vowels, and as a cor- 
onal pronunciation for consonants, while Dorsal is used to characterize back 
vowels or dorsal consonants. A separate node is used to specify the degree of 
opening of vowels. A further discussion of this important proposal is beyond 
the scope of this chapter (cf. Clements and Hume 1995, Kenstowicz 1994a). 


(14) SUPRALAR 


te Om 


[-cont] PLACE 


LABIAL DORSAL secondary 


[-back] 


The two place nodes in a segment with secondary articulation are not sequenced in 
time. Although in the IPA symbols the superscripts indicating labialization, velarization, 
etc. conventionally appear to the right of the consonant symbol, the two components of 


Complex segments 179 


a secondary articulation segment are phonologically simultaneous. That is, a side-view 
would show a straight line. 


Q105 There have been no reports of complex segments like [px] or [dn]. How do the 
representations assumed above account for this? 


12.3.1 Evidence for complex segments 


Complex segments are single segments, and as such distinct from sequences of seg- 
ments in having a single root node, a representational difference which we would expect 
to see reflected in the behaviour of complex segments. For instance, a manner-contour 
segment like [mb] should not be treated as if it were a sequence of [m] and [b], while a 
palatalized [ should not be treated as if it was [t] followed by [j]. An obvious place to 
look for such single-segment behaviour is the way they are incorporated in syllable 
structure. At first sight, we might be tempted always to regard phonetic sequences like 
[mb] or [nd] as sequences of phonological segments. However, the fact that in Bakwiri, 
for example, these are the only phonetic sequences ever to appear prevocalically is a 
strong indication that [mb], [nd], [ng] are single segments in this language, as this analy- 
sis will allow us to restrict its syllable structure to CV: it explains why no other combina- 
tions of consonants ever occur. Under the alternative analysis with CC onsets, an 
explanation would have to be found for the peculiar restriction that only sequences of 
nasal plus homorganic plosive can fill CC onsets, even though clusters like [st-] or [pj-] 
are fairly common in other languages. Another conceivable alternative is equally unat- 
tractive. If we assumed that the syllable structure was CV(C), [limba] would be syllabi- 
fied [lim.ba] rather than [li.mba], while a word like [mbila] would be [m.bi.la], with a 
syllabic [m]. This analysis can be empirically excluded on the basis of the word game that 
appeared in the question on page 148, in which the syllables of disyllabic words are 
inverted (Hombert 1973, 1986), as shown in (15). Clearly, if the language were to allow 
codas we would expect the syllable structure of the word for‘rice’to be *[kén.di], and the 
inverted form to be *[dikOn]. This, however, is not the correct game form. 


(15) Word Game form 


mòkò kòmò ‘plantain’ 
lowa wal ‘excrement’ 
kwéli likwe ‘falling’ 
kondi ndikò ‘rice’ 


Q106 in Tswana, [I] loses its lateral pronunciation before [i], in which context it is 
pronounced [d] (Whitley 1978). Thus, [I] occurs in [lefifi] ‘darkness’, [lòlémè] ‘tongue’ 
and [xòpálà] ‘marry’, while [d] appears in [badisé] ‘herd’, [mosédi], ‘woman’ and 
[p6di] ‘goat’. Tswana has a voiceless lateral coronal affricate [tt], which occurs 


before all vowels, including [i], as illustrated by [xòttà] ‘come’, [ttilé] ‘we came’. 
Argue on the basis of the rule of DELATERIZATION that [tt] is a single segment in 
Tswana. 
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Q107 In Ewe, verb stems reduplicate to form a present participle, noun or adjective. 
Determine what segments of the verb stem are reduplicated. Why do these data 
(from Sagey 1986: 86) demonstrate that Ewe has compound-place segments and 
an affricate? 


Gloss Derivation Gloss 
fo ‘to beat’ fofo ‘beating’ 
zo ‘to walk’ z0z0 ‘walking’ 
aha + no ‘liquor + drink’ ahanono ‘liquor-drinking’ 
fja ‘to burn’ fafja ‘burning’ 
bja ‘to ask’ babja ‘asking’ 


fle ‘to buy’ feflee ‘buying’ 

pra ‘to rave’ paprala ‘a raver’ 

kplo ‘to lead’ kpokplo ‘leading’ 

avo + sja ‘cloth+to dry’ avosasja ‘cloth-drying’ 

tsi ‘to grow’ tsitsi ‘grown up’ 

gbla ‘to exert oneself’ gbagblam ‘exerting oneself’ 


12.3.2 The representation of palatals and palatoalveolars 


Since in the consensus model (Broe 1992), roundness requires a labial articulator 
node and tongue body specification requires a dorsal articulator node, round vowels 
have two articulator nodes. Segments like [y,8,u] are therefore complex, i.e. labial- 
dorsal. Thus, the place specification of the segment [u] is identical to that of the labial- 
dorsal approximant [w], as well as to that of the labial-dorsal plosive [gb]. But while 
the assumptions made so far correctly characterize [u] and [w] as a natural class, they 
fail to characterize [i] and [j] as having the same place of articulation. The assump- 
tion that [i,j] and [u,w] form natural classes is supported by processes in a number of 
languages which have alternations between [i] and [j] as well as between [u] and [w]. 
In such cases, [i,u] appear in the syllable peak, while [j,w] appear elsewhere. Our 
treatment so far has assigned the feature [coronat] to palatoalveolars and 
(pre)palatals like [j], but [porsat] to vowels, including front vowels like [i]. Not only 
does this make it impossible to describe alternations between [i] and [j] as involving 
a difference in position in syllable structure, it also fails to account for a number of 
effects [i] and [j] have in common (Bhat 1978, Lahiri and Evers 1991). 


1. The segment [j] triggers fronting of velars, just like [i] (and possibly other 
front vowels). For instance, French [kjosk] ‘kiosk’ has the same [k] as does [ki] 
‘who’. The fronting is easily explained for [i], which is [porsaL, — back]. But why 
should [j], a coronal, have the same effect? 


2. In many languages, [k] before [i] has changed into prepalatal (i.e. [coronat]) 
[c], and from there into palato-alveolar [tf]. Taking this route, Latin 
[ki:wi'ta:tem] and [‘kirka:] ended up as [tfit'ta] ‘city’ and ['tfirka] ‘approxi- 


mately’ in Italian. If [i] is [porsat], how can it affect a dorsal consonant so as to 
become a coronal consonant? 
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3. Many languages, of which Japanese is an example, have processes that change 
[t,s] into [tf,f] before [i], as in [sitimi], pronounced [fit fimi], ‘pepper’. [tf,/] 
are coronal. If [i] is [porsat], then why don’t we get dorsal [Sikimi] instead? 


Beginning with Keating (1987), many researchers have analysed segments like [j] as 
compound-place segments with both a coronal and a dorsal articulator. In this view, 
[j,i] both have the structure as in (16). With Jacobs (1989) and Jacobs and van de 
Weijer (1992), we assume that the palatoalveolars [f,3,tJ,d3] are also coronal-dorsal. 
This view is supported by articulatory data, in the sense that all these segments 
involve articulations between the forward part of the tongue and a section of the 
palate stretching some 2cm back from the alveolar ridge. These data have been 
obtained with the help of X-ray pictures with side-views of the tongue, as well as with 
the help of electropalatography, a technique for recording contact areas of the roof of 
the mouth (Recasens et al. 1995). 


(16) ROOT 
—cons 
+son 
ee ae, 
LAR SUPRALAR 
十 voice ee 
nas 4+ cont 
=lat PLACE 
CORONAL DORSAL 
—ant 十 dist 一 back +high 一 low 


This feature analysis easily accounts for all three observations above. First, French 
VELAR FRONTING of [k] to [k] in the environment of [i,j] can be accounted for as spread- 
ing of the [back] node of [j] and front vowels to the dorsal node of the velar. This is 
shown in (17), where the [—back] feature of [i,j] will associate with the dorsal node 
of [k]. (Depending on the language, the rule may have to be constrained to prevent 
other front vowels from triggering it.) 


(17) VELAR FRONTING DORSAL 


+ back 


SD: [-back ] 
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This analysis also accounts for the occurrence of [¢] instead of velar [x] in the envi- 
ronment of [j] and front vowels, as in Greek ['çilia] ‘thousand’ versus [Xo'ros] ‘dance’. 
That is, the fronting of [x] to [ç] amounts to the replacement of [十 back] with 
[一 back] in the dorsal fricative. 

The assimilation of [k] to [c] or [tf] can be described as the spreading of 
the coronal-dorsal node of [j,i] to the supraLar node of the velar, as shown 
in (18), where the left-hand segment, [k,g], receives the PLACE node of the follow- 
ing [i,j]. 


(18) a SUPRALAR 


PLACE 


— nas ig 
—cont back 


Sp: ied Es} 
And, to complete the story, (19) describes the assimilation of [t,s] to [tf,f] as the 


spreading of the PLACE node from [i,j] to the consonant. This analysis predicts that the 
coronal specification of [t,d,s,z] becomes [— ant, + distr], which is correct. 


(19) SUPRALAR 
N 
N 
SC: N 
PLACE 

COR heer 
SD i gol | +high 
-back 


These spreading operations only account for the place assimilation. The manner 
features will in many cases be supplied. For instance, if the product of (18) is [c], noth- 


ing needs to be done, but if it is [tf], [+strid] needs to be added (cf. Kim 
2001). 


>The [ç] is classed as a complex coronal-dorsal by Keating (1987). In addition to making it impossible for [f] to 
be coronal-dorsal, this characterization does not do justice to the more retracted articulatory contact for [ç] 
than for the palatoalveolars and [j]. Also, the implied suggestion that the fronting of [k] is a different process 
from the fronting of [x] has not, as far as we know, been confirmed. 
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12.3.3 Distinguishing among palatoalveolars 


The coronal-dorsal representation of [f,3,tf,d3] would appear to offer a satisfactory 
description of these consonants, which are typically pronounced with the tongue tip 
raised towards, or articulating with, the alveolar ridge, and a simultaneous raising of 
the tongue blade and forward part of the front towards the postalveolar area of the hard 
palate. The coronal node is responsible for specifying the first type of articulation, 
while the dorsal node is responsible for the second aspect. Of course, the coronal node 
can dominate a number of specifications for the features [ant] and [+distr]. Even 
though different palatoalveolar consonants rarely contrast in the same language, the 
implicit prediction that there are different types of palatoalveolar consonants is cer- 
tainly correct. First, a [—ant,+ distr] articulation will have the blade articulating with 
the postalveolar region, while allowing the tongue tip to be behind the lower teeth. This 
is typically the tongue position for [j] and [i]. In the case of [十 cons] segments, this 
type is sometimes referred to as ‘prepalatal’ or ‘alveopalatal’. The obstruents are some- 
times symbolized [¢,z,t¢,dz], particularly in the case of languages in which they con- 
trast with other palatoalveolars. In Serbocroat, for instance, [t¢] contrasts with [tf], for 
which the coronal contact is postalveolar or retroflex, i.e. [—ant,—distr], as shown by 
[t fair] ‘enchantment’ - [teaxr] ‘profit’. (Both coronal-dorsal affricates are distinct from 
the coronal affricate [ts], as shown by ['tsiiriti] ‘to leak — [tgiiriti] ‘to blow (of wind)’ 
and [tseh] ‘guild, union’ ~ [tJéh] ‘Czech mar.) Similarly, Polish contrasts retroflex [s, 
ts] with alveopalatal [¢, tẹ], as in prosze [proge] ‘please’, czas [tgas] ‘time versus prosie 
[proce] ‘piglet’ and ciasny [tçasni] ‘tight’. 

English [f,3,tf,d3] are [++ ant, — distr]: in addition to the raised front and/or blade, 
there is an apical articulation with the alveolar ridge. This representation accounts for 
an apparent asymmetry in the application of coronal ASSIMILATION in English. Recall 
that this process causes coronal [t,d,n,l] to adopt the place of articulation of a follow- 
ing coronal consonant, so that the [t] of that is dental in that thing, alveolar in that tin 
and postalveolar in that trip (cf. p. 159). The apparent asymmetry is that before pala- 
toalveolars the place of articulation of [t,d,n,l] remains unaffected. Even though the 
forward part of the front may show some anticipatory raising towards the hard palate, 
[t,d,n,l] retain their contact between the tip and the alveolar ridge, just as they do 
when a vowel follows, so that the location of the crown during [t] in that egg is the 
same as that during [t] in that chore, and the [n] in keen officer is no different from the 
[n] in keen judge.* Our analysis readily accounts for these data: [t f,d3] have a coronal 
node with [+ ant, —distr], which explains the articulatory action of the tongue tip, in 
addition to a dorsal node with [一 back], which explains the raising of the front of the 
tongue. Since it is only the coronal node that spreads to a preceding [t,d,n,]], the effect 
on their place of articulation is entirely vacuous, these consonants already being 
[ + ant, —distr]. By contrast, if we were to analyse palatoalveolars as simplex coronal 
segments, as was done in Chapter 5, they would have to be [—ant, + distr], which fea- 
tural characterization would be necessary to explain the largely postalveolar place of 


“Clements (1985) notes that the articulation of [t,d,n], but not of [I], changes before palatoalveolars. At least in 
British English, we believe there is no such distinction. 
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Table 12.1 
Feature analysis of coronal, coronal-dorsal and dorsal consonants stands for the presence 
of a unary feature 


Coronal Coronal-dorsal Dorsal 
Apical [t,s] Prepalatal [i,j,y,f] Eng. [f,3,t/,3] kg kx] 
Coronal V y Vy 
Anterior + = T 
Distributed ”一 + = 
Dorsal V V qpa 
Back 一 = 


articulation. This would incorrectly predict that coronar ASSIMILATION causes a consider- 
able retraction of the place of articulation of [t,d,n,l] before palatoalveolars. 

We summarize our feature analysis of coronal, coronal-dorsal and dorsal conson- 
ants in Table 12.1. 


Q108 Tahitan has the following system of oral coronal [+ cons] segments (Shaw 1991): 


ts’ 
5 


6 3 


The dental, aiveolar and palatoalveolar consonants in the last three columns are involved in a 
consonant harmony process of CORONAL HARMONY. Within the word, the rightmost consonant in 
this group determines the place of articulation of all other consonants in the same group. This 
is illustrated with [s] ‘first person singular subject’, and with [Oi] ‘first person dual subject’ in 
the data below. The (a) examples show these suffixes in their unassimilated form, while the (b) 
examples show the effect of coRONAL HARMONY. The consonants in the first two columns above 
are transparent to CORONAL HARMONY: they neither trigger nor block the process. This is 
illustrated by the (c) examples. 

Underlying [s] Underlying [0i] 
a. esk’ar . ‘Tm gutting fish’ deQigitt ‘we threw it’ 

esdan ‘f'm drinking’ 9it9adi ‘we ate it’ 

sesret ‘I'm sleepy’ naQiba:tt ‘we hung it 

hudiftfa ‘I love them’ ifitfott ‘we blew it up’ 

teneftfu:f ‘I'm folding it’ ufidze ‘we are called’ 

£@du:0 4whipped him’ xaşi:dets ‘we plucked it’ 

meQe8eq Tm wearing’ desidzel ‘we shouted’ 

edede@du:8 ‘I whipped myself’ —desit’as ‘we are walking’ 

jaftt'etf ‘I splashed it’ me?efit’otf ‘we are breast-feeding’ 
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How can we tell from these data that the two suffixes have different underlying forms? 
What are the three place specifications of the consonants involved in CORONAL HARMONY? 
Why is it impossible to view CORONAL HARMONY as the leftward spreading of a [CoRoNAL] 


node to consonants that are unspecified for [coRoNAL]? 

Is it possible to view CORONAL HARMONY as a rule that spreads a [coRONAL] node to 
consonants that have a [coronaL] node, while skipping consonants that do not? Motivate 
your answer. 


12.4 CONCLUSION 


Many languages place restriction on the combination of different values of the same 
phonological features in the word. Such word-based distributional restrictions 
include vowel harmony and consonant harmony, but also word-based distributional 
patterns of nasalization and obstruent voicing occur. The autosegmentalization of fea- 
tures, together with the underspecification of certain features in certain segments, 
allows us to describe these patterns with the help of many-to-one associations, a more 
insightful solution than is possible in a listed representation of features, as was usual 
in the linear theory of SPE. To account for transparent segments, however, a rule- 
based description will need to fall back on constraints. This was seen as an argument 
for abandoning the concept of rules that literally change one representation into 
another, and for adopting a theory like OT, which relies on constraints that simply tell 
us what surface representations must look like. 

We also saw that the feature tree allows for a natural representation of complex seg- 
ments. It was shown that, in addition to obviously complex segments like [mb], [kp] 
and [t™], palatal and palatoalveolar consonants can fruitfully be interpreted as (com- 
plex) CORONAL-poRSAL segments, a type of representation that makes it possible to 
describe naturally various place assimilations involving palatal or palato-alveolar 
articulations. 


Stress and feet 


13.1_INTRODUCTION 


This chapter discusses two conceptions in which word stress has been represented in 
phonological theory, first, by means of a feature [ 士 stress] in a linear model and, sec- 
ond, by means of a structural position in a nonlinear model. After a brief description 
of the phonological nature of stress and of the notion of foot in section 13.2, section 
13.3 discusses the way in which stress was accounted for in linear phonology. In sec- 
tion 13.4 we will provide an overview of the nonlinear metrical theory proposed by 
Hayes (1981) and show that it allows for a more principled way of accounting for the 
different stress patterns of the world’s languages. 


13.2 THE PHONOLOGICAL NATURE OF STRESS 


It used to be thought that stress was realized by some phonetic parameter, much in the 
way that [coronat] is realized by a raising of the crown of the tongue. While at first, 
loudness was taken to be the relevant phonetic parameter, experiments by Fry (1955; 
1958) showed that a number of different parameters were involved in creating the 
impression of stress in English, specifically duration, pitch variation and vowel qual- 
ity. Beginning with Liberman and Prince (1977), it was realized that stress was not a 
phonological feature that was given some content by the phonetic implementation 
rules, as assumed in the linear theory of SPE, but a structural position. The structural 
position is the foot, which is a phonological constituent above the syllable and below 
the word. It is typically characterized by one strong and one weak syllable. As explained 
in Chapter 1, one of the feet in a word is the strong foot, and its strong syllable (its 
head) will typically be more prominent than the other feet, for instance because it 
attracts intonational tone or extra duration. This syllable has the main stress of the 
word, while the heads of the other feet are said to have secondary stress. 

The question of how stress is realized depends on the question how a language 
chooses to use the structural position represented by the foot. English is a language 
that makes a very clear distinction between stressed and unstressed syllables. For one 
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thing, only a subset of the vowels is allowed in the weak position of the foot: [9,1,i,U], 
as in the final syllables of villa, market, folly and fellow. Second, weak syllables are 
noticeably shorter than strong syllables, particularly in non-word-final position. 
Thus, in Aberdeen, which consists of the feet [æbə] and [di:n], the middle syllable is 
very much shorter than either the first or the third, both of which occur in the strong 
position of a foot. Third, the head of the foot is used as the association site for inton- 
ational tones. In a citation pronunciation, an H*L melody is associated with the strong foot, 
and an H* or H*L melody with a preceding weak foot. This is shown in (1a,b) versus (1c). 


oy ey oe 


a. a b. i asi c. Aberdéen 


H*L H*L H MPI 


Compared with the situation in English, Japanese feet are not at all phonetically 
salient (Poser 1990). Importantly, a lexical HL melody can occur on any syllable in the 
Japanese word, and thus does not have to go to the head of a foot. In brief, languages 
vary in the way they employ the structural position represented by the foot and, as a 
result, in the extent to which we perceive the head of the foot as more prominent than 
the weak syllables; but what they crucially share is the set of phonological principles 
that determine where the feet are located in a word. 


13.2.1 Metrical feet and feet in poetry 


The notion of the foot as used in metrical theory is similar, but not identical, to the 
foot known from metrical poetry. The essential aspect of the foot both in poetry and 
in stress theory is that it represents a kind of grouping of syllables into constituents. 
To illustrate this more clearly, let us consider Latin. Latin poetry was based on a quan- 
titative rhythm (alternating heavy and light syllables). As explained in Chapter 10, a 
syllable is light if it has a short vowel and is not closed, while a syllable that ends in a 
consonant or has a long vowel is heavy. One of the best known meters is the dactylic 
hexameter, a verse line which consists of six dactylic feet. A dactylic foot is a group of 
syllables, possibly separated by word boundaries, that consists of one heavy syllable 
followed by two light syllables. Alternatively, two heavy syllables may constitute one 
foot. Consider the opening verses of Virgil’s Aeneid in (2), where we have given the 
metrical scansions for the dactylic hexameter above the verse lines. Word stress has 
been indicated with an acute accent. 


(2) - UU/- UU/- -/- -/- UU/-— 
Arma virúmque cdno Tróiae qui primus ab éris 
I sing about the man and his weapons who first from the shores of Troy 
SON i= eal Aa eet fama 
Itdliam, fáto prófugus, Lavinia  vénit 
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to Italy, driven by fate, to the shores of Lavinia came 
— UU 
Littora... 


In English poetry the rhythm is based on stress, that is, on alternating stressed and 
unstressed syllables. The most common meter is the iambic pentameter. Each verse 
line consists of five iambic feet. An iambic foot is a group of two syllables, again pos- 
sibly separated by a word boundary, of which the first is weak (W) and the second 
strong (S). By way of optional variation, the first foot may be SW. Consider the open- 
ing lines of one of Shakespeare's famous sonnets in (3). 


(3) W S/WS/ W S/WS/ WS 
Shall I compare thee to a summer's day? 
W S/W S/WS/IW S/IWS 
Thou art more lovely and more temperate. 
W S IWS JWSIW SH WS 
Rough winds do shake the darling buds of May... 


What we observe in (2) and (3) is that the S(trong) and W(eak) positions of feet in 
poetry need not coincide with stressed and stressless syllables, respectively. In fact, 
meters as well as languages vary in the extent to which they enforce such a one-to-one 
matching. For instance, as in (2), the word stress of cano is a weak position and the 
unstressed syllable [no:] is a strong position. In Latin this was quite common, whereas 
in English similar mismatches, as in the case of theeto, are allowed only in limited cir- 
cumstances (see e.g. Kiparsky 1977, Hayes 1989). 

In poetry the feet or the meter are used for esthetic reasons. In metrical stress the- 
ory, feet serve no such purpose, but represent the rhythmic structure of the word, in 
other words, its stress pattern. As such, they are intended to provide the descriptive 
units necessary to account for the stress patterns that may exist in the world’s lan- 
guages. Metrical theory is thus primarily concerned with the properties that govern 
the determination of the location of the stressed syllables in a word, whose phonetic 
realization may vary from language to language. Metrical theory is not one single the- 
ory, but rather consists of a number of alternative proposals. In the next section we 
will discuss the way in which word stress has been represented in linear phonology. 


13.3 STRESS AS AN ABSOLUTE PROPERTY OF SEGMENTS: LINEAR 
PHONOLOGY 


As an example of how stress was accounted for in the linear phonological theory of 
SPE, let us continue our discussion of Latin. In Latin, monosyllabic words are stressed 
on the only syllable available. In words that have more than one syllable, the main stress 
is never on the final syllable. Thus, in bisyllabic words stress is always realized on the 
first of the two syllables. In words of three syllables or more, stress placement becomes 
slightly more complicated. Main stress can either be on the prefinal (penultimate) or 
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preprefinal (antepenultimate) syllable. In these cases the location of stress depends on 
the weight of the prefinal syllable. The minimal pairs in (4) illustrate that vowel length 
in Classical Latin is distinctive. 


(4) populus ‘people po:pulus ‘poplar tree’ 
malum ‘misfortune = ma:lum ‘apple’ 
liber ‘book li:ber ‘free’ 
lego Tread’ le:go ‘I appoint as delegate’ 
furor ‘fury’ fu:ror ‘T steal’ 


Main stress was realized on the penultimate syllable if that syllable was heavy, 
otherwise main stress was realized on the antepenultimate syllable. Some examples 
are given in (5), where l and h stand for light and heavy. 


(5) a. me b. ldc 

‘honey’ ‘milk 

c. má lum d. má. lum 
‘misfortune ‘apple’ 

e a mÉ cum f. cd me ram 
1h 1 l 
‘friend’ ‘(bed)room’ 

g. dr bo rem h. pe dés ter 
pi l h 
‘tree’ ‘on foot’ 


In (5a) and (5b) the main stress is on the only available syllable in the word. In the 
bisyllabic words, (5c) and (5d), stress is on the first syllable. The words in (5e-h) have 
three syllables. In (5e) the prefinal syllable is heavy because the vowel is long, and the 
main stress is therefore on the prefinal syllable. In (5h) the prefinal syllable is heavy 
because it is closed. Again, the main stress is on the prefinal syllable. In both (5f) and 
(5g) the prefinal syllable is light, and therefore the stress is on the preprefinal syllable. 

The stress rules for Latin can be given in terms of the SPE formalism discussed in 
Chapter 6 as the three rules in (6), where V stands for a short and V either for a long or 
for a short vowel. 


(6) a V-—>[+stress] / __CyVC,VCo# 
b. V—[+stress] / _ CoV Co# 
c. V 一 [+stress] / _ Co# 


If applied in this order, rule (6a) is tried first, and only if its structural description is 
met ~ that is, if there is a word of more than two syllables with a prefinal light syllable — 
will it apply. It cannot apply if the word contains only one or two syllables, or if the 
word contains more than two syllables and the prefinal is heavy. In that case, rule (6b) 
is tried. This rule will apply in all remaining cases, unless the word has only one syl- 
lable. Rule (6c) will then provide monosyllabic words with main stress. The three rules 
in (6) can be collapsed as the single rule in (7), with the condition that the longest 
expansion of the rule is tried first, followed by the second longest, etc. 


(7) V—>[+stress] / _ Co((VCI)V Cy) # 
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Secondary stresses in longer words, such as liberatidnem ‘delivery, liberation’ can be 
accounted for by applying rule (8) as often as it can apply to a word. This has the effect that 
stresses are assigned to every syllable that is separated by one syllable from the following 
stressed syllable. This repeating action of a rule is called iterative rule application. 


(8) V—>[+tstress] / _ CyVC)V[+stress] Co (iterative) 


At the beginning of this chapter, we observed that a number of objections have been 
raised against this linear way of accounting for stress patterns. Here we note one more 
objection. The rules involved are purely descriptive. That is, although stress patterns 
that occur in natural languages can be described by rules similar to the ones in (6-8), 
there is no explanation for why the stress patterns are the way they are. The formalism 
allows in principle for all kinds of patterns that are never attested. Secondary stresses, 
for instance, typically alternate in a binary fashion. However, a rule assigning stress 
after every two, three, four or more syllables is not more or less marked than a rule like 
(8). This ‘binary’ aspect, which we might call the ‘counting-by-two’ aspect of stress 
rules, is something that is left unexplained in the linear framework. In the next section 
we will review the way in which metrical theory is more successful in providing 
a principled theory of stress patterns. To round off this section, let us illustrate 
the ‘counting-by-two’ aspect by giving the somewhat more complex stress system of 
Cairene Arabic (cf. McCarthy 1985, Hayes 1981, 1995, among others). In Cairene 
Arabic the main stress is on the final syllable, if that syllable is superheavy (ending in 
a long vowel plus one consonant, or a short vowel plus two consonants). If the final syl- 
lable is not superheavy, the main stress is realized on the penultimate syllable if heavy. 
If neither of these two conditions is met, main stress is realized on the penultimate or 
antepenultimate syllable depending on which of these two is separated by an even 
number of syllables from the rightmost, nonfinal heavy syllable in the word. If the 
word does not contain any heavy syllables at all, stress is realized on the syllable that is 
separated by an even number of syllables from the beginning of the word. Some 
examples are given in (9), where sh stands for superheavy. 


(9) a) sa ka kiin b) Sa mál ti 

I-A s WE 
‘knives’ ‘you (fsg) did’ 

c) mux ta li fa d) mar ta ba 
be Si KoT] 
‘different (ESG)’ ‘mattress’ 

e) bi xa la f) fa dja ra tu hu 
jn 下 
‘misers’ ‘his tree (NOM.) 


g) Ja dja ra tu hd maa 
linol 1 ateky ol h 
‘their (dual) tree (nom.)’ 


In (9a) the main stress is on the superheavy final syllable, and in (9b) it is on the 
prefinal heavy syllable. In (9c) and (9d) there is a nonfinal heavy syllable in the word 
and, being the only one, it is also the rightmost one. In (9c) main stress is on the 
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antepenultimate syllable rather than on the penultimate syllable, because the syllable 
[ta] and not [li] is separated by an even (zero) number of syllables from the rightmost 
nonfinal heavy syllable [mux]. In (9d) main stress is on the penultimate syllable and 
not on the antepenultimate one, because the syllable [ta] is separated by an even num- 
ber of syllables from the rightmost nonfinal heavy syllable, [mar]. In (9e-g), there are 
no nonfinal heavy syllables, so the main stress is determined by counting by two from 
the left edge of the word. 


Q109 in Passamaquoddy (Stowell 1979, Dell and Vergnaud 1984, Hayes 1995), syllables 
with full vowels count as heavy and those with schwa as light. Formalize the stress 
rule in your own words. 


mekanutasdpanik ‘those who must have been chosen’ 


akanutamakan ‘story’ 
luhkewinawak ‘workers’ 
elakilakapan ‘he was big’ 
wamasanamanal ‘he gets them’ 


13.4 STRESS AS RELATIVE PROMINENCE: NONLINEAR PHONOLOGY 


13.4.1 A parametric theory of relative prominence 


Hayes (1981), elaborating on Liberman and Prince (1977) and Halle and Vergnaud 
(1978), proposed a theory of stress that is an improvement upon the linear account 
in two respects. First of all, stress is no longer represented by means of a feature 
[stress], but is essentially considered to be a strength relation between syllables. 
That is, stress is formally represented by using binary branching tree structures in 
which one node is dominant and the other recessive. The dominant node is stronger 
than its recessive sister node, and accordingly the nodes of such binary branching 
trees are labelled S(trong) and W(eak). Second, Hayes proposes to account for the dif- 
ferent stress patterns in natural languages by a number of parameters. Each param- 
eter represents two choices between which languages will choose. Hayes (1981) 
assumes the four parameters in (10). 


(10) Right-dominant vs left-dominant 
Bounded vs unbounded 
Left to right vs right to left 
Quantity-sensitive vs quantity-insensitive 


We will briefly explain each of these parameters. The first parameter allows languages 
to vary in whether it is the right node or the left node in a binary branching tree structure 
that is dominant: in any one language, nodes are either labelled (WS) right-dominant 
or (SW) left-dominant. j 

The second parameter is necessary to describe the difference between bounded and 
unbounded languages. In bounded languages, the main stress is located at a fixed 
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distance from the boundary of the word. In the unmarked case, main stress may be 
separated by one syllable from the right or left word boundary. In addition, secondary 
stresses are located at fixed intervals from other stresses. As we observed in section 13.3, 
secondary stresses are typically separated by one unstressed syllable. In unbounded 
stress systems, stresses cannot be located at a fixed distance in this way. Proto-Indo- 
European, for instance, had the following system. The leftmost heavy syllable, which 
might well be the last syllable in a long word, received the main stress. If a word did not 
contain a heavy syllable, the main stress was located on the first syllable (cf. Halle and 
Vergnaud, 1987). Tetouan Arabic (van Hoorn 1996) is the mirror image. It has the main 
stress on the rightmost heavy syllable (which may well be the first syllable in a long 
word) or, if the word does not contain heavy syllables, on the penultimate syllable. 

The primary stress in unbounded languages is thus pulled towards heavy syllables 
quite regardless of the distance that separates the heavy syllable from the edge of the 
word. In both Proto-Indo-European and Tetouan Arabic, [tat.ta.ta.ta.ta] will have the 
primary stress on the first and [ta.ta.ta.ta.ta:] on the last syllable, and only when 
there is no heavy syllable, as in [ta.ta.ta.ta.ta], will the difference between the two lan- 
guages come out. Bounded languages, by contrast, will not allow the primary stress to 
stray too far from the word’s edge, and will place it on a light syllable if no heavy syllable 
is available. In Hayes’ (1981) theory, this resolves as a difference in allowable foot size: 
bounded languages have maximally binary feet (cf. (11a)), while unbounded languages 
put no upper limit on the size of a foot. Technically, the distinction is derived by allow- 
ing unbounded languages to have S-nodes higher up in the foot tree (non-terminal S- 
nodes). In (11b,c), the nodes indicated by a dot can thus be labelled S, potentially 
allowing feet to grow without limit. A prediction of this account is that unbounded 
languages do not have the typical binary rhythm of secondary stresses in longer words. 


(11) (a) o o (b) o o o o 
s w s w w w 


RS 


Bounded languages cannot have non-terminal S-nodes, and because, by stipulation, no 
language can have non-terminal (i.e. branching) W-nodes, the only way a four-syllable 
structure like (11c) can be parsed in a bounded language is as in (12). The two foot 
nodes will then be gathered up in a word tree to complete the prosodic structure of the 
word (see below). 


o (co o 
w s w 


(12) o CT o o 

S w S w 

IRL cs SS 
Languages can thus have unbounded stress systems if S nodes are not necessarily 
terminal nodes. In the remainder of this chapter we will focus on bounded stress 


systems. (See Prince (1985) for a proposal to describe unbounded stress systems by 
using bounded feet.) 


Stress as relative prominence: nonlinear phonology 193 


Languages that have bounded stress systems may differ in whether the main stress 
(located at a fixed distance from the word boundary) occurs at the beginning or at the 
end of the word. To capture this difference, the third parameter in (10) is necessary, 
which allows foot construction to start either at the right edge or at the left edge of a 
word. A further difference among bounded stress systems, represented by the fourth 
parameter in (10), is related to the importance of the internal structure of the syllable 
when constructing feet. In one group of languages, the fact that a syllable is heavy or 
light does not influence the construction of feet. Languages of this kind are said to be 
quantity-insensitive. In another group of languages, the internal structure of the 
syllable has to be taken into account. In a quantity-sensitive language, a W node may 
not dominate a heavy syllable. The difference between quantity-sensitive and quantity- 
insensitive stress systems is derived by assuming that in the former case metrical trees 
are built on the rhyme projection, where the distinction between heavy and light 
syllables can be formalized as a distinction between bimoraic versus monomoraic 
rhymes. In quantity-insensitive languages, trees are built on the syllable projection, 
where the difference between heavy and light syllables is not visible. If we disregard the 
direction parameter in (10), the third one, the following four basic bounded tree types 
are possible: quantity-insensitive, left-dominant (QI-ld), quantity-insensitive, right- 
dominant (QI-rd), quantity-sensitive, left-dominant (QS-ld), and quantity-sensitive, 
right-dominant (QS-rd). In the next section we will discuss each of these possibilities. 


Q110 To what kind of input must a child be exposed in order to determine that stress in 
his or her language is 


1. sensitive to quantity? 
2. unbounded? 


13.4.2 Four types of bounded stress system 


We discuss the four types of bounded stress system identified above in the order 
QI-ld, QI-rd, QS-ld and QS-rd. 

Garawa is an example of a QI-ld stress system. Some data are given in (13) (see 
e.g. Furby 1974, Hayes 1981). 


(13) ja.mi pin.ja.la  wacim.pa.nu 
‘eye ‘white’ ‘armpit’ 
ka.ma.la.rini já.ka.là.ka.làm.pa 
‘wrist  、 ‘loose’ 
nan.ki.ri.ki.cim.pa. ji 
‘fought with boomerangs’ 
Main stress is always on the initial syllable and secondary stress iterates in a binary 


fashion, i.e. occurs on every other syllable, from right to left. The rules in (14a) will 
produce the metrical structures in (14b). i 


(14a) Construct a QI-ld foot at the left-edge of the word 
Assign QI-ld feet from right to left 
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1 


(14b) ja mi pin ja la wa cim pa pu 
S W S w S w S w 
ka ma la rin gi ja ka la ka lam pa 
S Ww 5 W s w s w S W 


pan i. aft oy Moy tm. pà -J 
oo 

Notice that the third syllable in words with an odd number of syllables cannot be 
grouped with another syllable in a foot, and as a result, monosyllabic feet are con- 
structed. In Hayes’ (1981) theory, a monosyllabic foot is labelled neither W nor S, but 
simply interpreted as a stressed foot. Languages vary in their treatment of monosyl- 
labic feet; sometimes they remain as such, and sometimes they are destressed (i.e., the 
foot is erased and the syllable it dominates attached to a preceding or following foot 
depending on whether the feet are left- or right-dominant). This is what must be 
assumed for Garawa also, given that the third syllable in these words does not have a 
secondary stress. The issue of monosyllabic feet will be taken up again in Chapter 14. 

The rules in (14a) describe which syllables are stressed, but do not differentiate 
between the syllable with the main stress and the secondary stressed syllables. In 
Hayes (1981) this is accounted for by an extra layer of tree structure above the level 
of the feet, which is called the word tree. If, for Garawa, we group the feet into a left- 
dominant, unbounded word tree, the first syllable will be the only one dominated by a 
strong node and will therefore have main stress, as illustrated in (15) for some of the 
forms in (14). Henceforth, as in the metrical representations in (15), the feet are sep- 
arated from the word tree by a horizontal line for reasons of clarity. As shown for 
[kamalaringi ] in (15), the destressed left-over syllable is attached to the foot on its left 
rather than to the foot on its right, given that feet in Garawa are left-dominant. 


(15) já mi pin ja la wa cim pa pu 
Sz W AZ W W S WwW Ay W 
s s w 
ka ma la rin ji ja ka lh ka lim pa 
S W W W S Ww w sS w 
$ 
s w s w w 
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The stress pattern of Weri (see e.g. Boxwell and Boxwell 1966, Hayes 1981) can be 
described by assigning QI-rd feet from right to left followed by the construction of a 
right-dominant unbounded word tree, as illustrated in (16). In Weri, monosyllabic 
feet are not subject to destressing. 


(16) kú li pd ‘hairofarm lù a mit ‘mist’ 
s 


le 
K 
i 


Classical Latin may serve to illustrate the QS-ld foot. In Latin polysyllabic words, as 
mentioned above, stress was never realized on the final syllable, but on either the 
penultimate or antepenultimate syllable. If stress is on the antepenultimate syllable, 
we may seem to need a ternary foot at the right edge of the word. However, in order to 
maintain the basic bounded foot types (maximally two syllables in a foot), Hayes 
(1981) uses the concept of extrametricality. At the periphery of a word - that is, at 
the right or left edge - a phonological constituent (syllable, segment, consonant, 
vowel, rhyme, mora, etc.) may be declared extrametrical, that is, be made invisible to 
the metrical tree construction rules. For Latin, a rule marking the final syllable extra- 
metrical prior to QS-ld foot construction is needed, as illustrated in (17). In (17) we 
have indicated extrametrical syllables by angled brackets. Furthermore, in (17) we 
have only built one QS-Id foot, that is, QS-ld feet are not constructed iteratively in 
Latin. The main reason for this is that secondary stresses in Latin alternate in a binary 
fashion regardless of syllable weight. In a word like voluptatem ‘voluptuousness’, sec- 
ondary stress is on the initial syllable, although the second syllable is heavy. The same 
holds for a longer word like liberatidnem ‘delivery’. This is accounted for by iterative 
assignment of QI-ld feet after the construction of a single QS-Id foot. Finally, a right- 
dominant, unbounded word tree is needed to promote the final foot to main stress. 


(17) a mí <cum> cá me <ram> 
l h w l l w 
s w 
| 
w s 
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y p nee pe dés <ter> 
i 
Ea LI 


vò lup tá <tem> 
1 hh w 


Three remarks on the derivations in (17) are in order. First, the initial monosyllabic 
feet in amicum and pedester wrongly predict that in those words there is secondary 
stress on the syllable preceding the main stress. Again, a destressing rule (not indi- 
cated in (17)) is necessary. Second, monosyllabic words like mel and lac (cf. (1) above) 
are stressed. The rule marking final syllables as extrametrical must therefore be 
blocked if it would render the entire stress domain extrametrical. Third, extrametrical 
syllables are adjoined as W nodes to the word tree. 

The final foot type to consider is the QS-rd foot. In Hixkaryana (see Derbyshire 
1979, Hayes 1995) a weight distinction is made between closed and open syllables — 
that is, in the underlying representations no long vowels occur. As in Latin, the 
final syllable in polysyllabic words is never stressed, and we thus need a rule of final 
syllable extrametricality. The construction of QS-rd feet from left to right and a right- 
dominant unbounded word tree will produce the correct metrical structures as shown 
in (18). Main stress is slightly simplified here: see Hayes (1995: 206) for more discussion. 


(18) ow to hd <na> th ku rié ho <na> 


h | l |i MAD l 

| Ry W | w Ss WwW ow 
w s w s 

Bei ee 


‘to the village’ ‘to Tohkurye’ 
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kha na nih <no> mi ha na nih <no> 
l h w l l i h w 


w S W S 
w S 


S 


‘Ttaughtyou . ‘You taught him’ 


Given that the prefinal light syllable in [tohkuriéhona ] is not stressed, a rule destress- 
ing the prefinal light monosyllabic foot is necessary. After destressing, this syllable is 
adjoined as a weak member to the word tree, as illustrated in (18). It cannot be 
adjoined to the preceding foot, because in Hixkaryana the feet are right-dominant. 
We will return to this in the next chapter. 


Q111 icelandic (Árnason 1985) exhibits the following stress pattern in noncompound 
words. (Because the acute accent is also used as an orthographic diacritic in 

Icelandic, we indicate main stress with the help of a stress mark.) 
‘Jon ‘John’ 
'tas.ka ‘briefcase’ 

'höf.ðing.jà ‘chieftain (gen. pl.)’ 
'a.kva.rèl.la ‘aquarelle’ 
‘bi.o.gra.fi.a ‘biography’ 


Propose an analysis. 


Q112 Here are some examples of stress in Hopi (Jeanne 1978, Hayes 1981). 
lés.ta.vi ‘roof beam’ 

ko.j6.no ‘turkey’ 

me.l6:.ni ‘melon’ 


pa:.wi.k’a ‘duck’ 
ca.qap.ta ‘disk’ 


Formalize the stress rules. 


13.5 _ CONCLUSION 


In this chapter we have discussed the linear and nonlinear representation of stress. We 
have illustrated the four basic foot types for bounded stress systems with examples 
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from Garawa, Weri, Classical Latin and Hixkaryana. The role of destressing rules has 
been discussed, and the concept of extrametricality has been motivated and illus- 
trated for Latin and Hixkaryana. Furthermore, we have seen that the distinction 
between main stress and secondary stress is accounted for by a separate layer of met- 
rical tree structure: the word tree. If we now compare the metrical theory advanced by 
Hayes (1981) to the previous linear account, then it is obvious that the objections 
raised above no longer hold. The ‘counting-by-two’ aspect is a direct result of the tree 
structures allowed, and there is no longer a feature [stress]. The metrical trees express 
in a quite natural way that stress is a relative property of syllables in a word. However, 
it still is not clear why the trees are the way they are. That is, although a much more 
principled way of accounting for the possible different stress patterns in natural lan- 
guages is available, it is still not clear why the patterns we find are in a way natural. 
This issue will be taken up in the next chapter. 


Further constraining stress 


14.1 INTRODUCTION 


This chapter discusses Hayes’ (1995) revised metrical theory on the basis of the stress 
patterns of the languages discussed in Chapter 13, and illustrates how stress can be 
accounted for in Optimality Theory. Section 14.2 discusses the three basic metrical 
units (the syllabic trochee, the iamb and the moraic trochee) of Hayes’ (1995) stress 
theory. The notion of catalexis is introduced, and it is shown that compared with the 
analyses in the previous chapter destressing rules are no longer needed. After that, in 
section 14.3, foot-based segmental rules are briefly discussed. Section 14.4 addresses 
the relation between stress and morphology. Finally, section 14.5 shows how stress is 
accounted for in Optimality Theory. 


14.2 IAMBIC AND TROCHAIC RHYTHM 


In Hayes (1985, 1987) it is observed on the basis of typological research that of the four 
basic foot types discussed above, the QI-rd foot is quite rare. That is, the majority of QI 
languages use the QI-ld foot. Moreover, among QS languages, the most frequent foot 
type is QS-rd. In addition, in languages that do need the QS-ld foot, it never appears to 
be constructed iteratively. Hayes (1995) connects these facts with the results of percep- 
tion experiments which show that listeners group sequences of even duration, like ta- 
ta-ta-ta-ta-ta ...,in a binary fashion with initial prominence, that is, (táta) (tdta)(tdta), 
but group sequences of stimuli of unequal duration, like ta-taa-ta-taa-ta-taa ...,ina 
binary fashion with final prominence, that is, as (tatda) (tatda) (tatda), etc. Hayes 
terms the former grouping ‘trochaic and the latter ‘iambic , and accordingly defines two 
fundamental laws of alternating rhythm: trochaic rhythm (even duration, initial 
prominence) and iambic rhythm (uneven duration, final prominence). He then pro- 
poses the foot inventory in (1), where bracketed grids are used instead of labelled trees: 
a bracketed grid (x .) is equivalent to a foot labelled SW, while (. x) is equivalent 
to a foot labelled WS. Also, in (1) the mora is used as a unit of syllable weight: a light 
syllable has one and a heavy syllable two moras (Chapter 10). 
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(1) (x .) 

a. SYLLABIC TROCHEE: oo 
(where ø is either 1 or h) 
(x .) 

b. MORAIC TROCHEE: bp 
(where uu is either Il or h) 
(- x) (x) 

C. TAMB: bo oor h 


The foot types (1a) and (1c) do more or less the same descriptive work as the former 
QI-ld foot and the former QS-rd foot, respectively. The moraic trochee (1b) is 
designed, among other things, to account for stress systems that in Hayes (1981) were 
described with the QS-ld foot, although it is not equivalent to the QS-ld foot. As 
already mentioned in Chapter 13, metrical theory consists of a number of alternative 
proposals. Different foot inventories have been proposed by, among others, Jacobs 
(1990), Dresher and Lahiri (1991) and Kager (1993). Also, a different way of deriving 
metrical constituent structure has been proposed by Halle and Idsardi (1995). 

Let us briefly consider how the languages discussed in Chapter 13 can be described 
with the Hayes (1995) foot inventory given in (1) above. The Garawa stress facts follow 
straightforwardly if the rules are as follows: assign a syllabic trochee at the left edge 
followed by right-to-left assignment of syllabic trochees, as illustrated in (2). The 
word-tree construction rules discussed in Chapter 13 have been replaced in Hayes 
(1995) by a general rule, called the END RULE, which promotes either the first (END RULE 
INITIAL) Or the last (END RULE FINAL) dominant position marked by an x to main stress sta- 
tus by adding an extra x to it on a superior layer of metrical structure. For Garawa we 
need the END RULE INITIAL (2b) in order to derive initial main stress. Notice that com- 
pared with the analysis in (14) and (15) in Chapter 13 no destressing is needed, given 
that left-over syllables are not footed by (1a). 


(2) a. já mi b. pún ja la c. wa cimpa nu 
SYLLABIC TROCHEE ee") (2) ix AK 2) 
END RULE INITIAL (er (x ) (x ) 


d. kamalarinji e. já ka làka làmpa 
SYLLABIC TROCHEE (x a) wd (x!) (ee DAK 7) 
END RULE INITIAL (x ) (x ) 


f. nån ki ri ki rim paji 
SYLLABIC TROCHEE (x~ 2), 4&2 fe.) 
END RULE INITIAL (x ) 


Since no monosyllabic feet are allowed by the SYLIABIC TROCHEE, the question arises of 
what, in this new analysis, happens to languages like Weri, which do not destress 
monosyllabic feet. (A monosyllabic foot in systems with (1a) or a monomoraic foot in 
systems with either (1b) or (1c) is known as a degenerate foot.) For this former QI-rd 
system, we could use a syllabic trochee left to right to produce the alternating pattern, 
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but we are left with the problem that the final syllable of words with an odd number of 
syllables will not be stressed. Also, words with an even number of syllables will incor- 
rectly receive secondary stress on the second syllable. Right-to-left syllabic trochee 
construction will always predict main stress on the penultimate syllable, which again 
is incorrect. It seems that the final syllable should form its own foot. This could be 
done by applying the END RULE FINAL twice: before and after iterative right-to-left syllabic 
trochee construction as illustrated in (3). 


(3) a. kdliph b. ulù amit 
END RULE FINAL x) x) 
SYLLABIC TROCHEE (x .) (x .) 

END RULE FINAL C~ =) ( x) 
c. aku nè te pal 

END RULE FINAL x) 

SYLLABIC TROCHEE fA): 3) 

END RULE FINAL ( x) 


An alternative solution would be to appeal to the notion catalexis (Kiparsky 1991), the 
opposite of extrametricality. Catalexis is the addition at the edge of a word of a prosodi- 
cally relevant, though segmentally empty, constituent (syllable, mora or segment). A 
catalectic syllable is thus an abstract syllable at the end of a word, which is treated as a real 
syllable by the footing algorithm. This would mean that, for instance, [kulipu] and 
[uluamit] can be parsed as in (4) by right-to-left syllabic trochee assignment and the 
END RULE FINAL. In (4), catalectic syllables are indicated by square brackets. 


(4) a. kdli plilo] b. ulda mit[o] 
SYLLABIC TROCHEE Ea 4) (eee) 
END RULE FINAL ( x ) ( : 


The notion of catalexis has been proposed in order to completely eliminate degenerate 
feet, which Hayes (1995) still allows in very limited circumstances. We will not go into the 
implications of catalexis theory here, but refer for some discussion to Kager (1995) and 
Jacobs (1994). We conclude that former QI-ld straightforwardly translates into syllabic 
trochee assignment, whereas the former QI-rd stress systems require some reanalysis. 

Let us next look at Hixkaryana. Final syllable extrametricality and left-to-right 
iambic construction followed by the END RULE FINAL produces the correct results, as 
demonstrated in (5). 


(5) a. òw to hé<na> b. toh ku rého<na> 
ol | Pei ls J 
IAMB (x) (. x) (x) Gx) 
END RULE FINAL ( x ) ( x ) 
c. kha nà nih <no> d. mi hà na nih <no> 


L b bY Lily'h 
IAMB (. x) (x) (x) x) 
END RULE FINAL ( x ) ( x ) 
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Importantly, because the iamb construction rule does not allow a single light syllable 
to form a foot on its own, the prefinal light syllable in [tohkur!éhona] is skipped. This 
means that again, unlike the analysis presented in Chapter 13, no destressing rule is 
needed. 

Let us finally look at Latin, previously QS-ld. Final syllable extrametricality, the 
construction of a moraic trochee at the right edge, followed by iterative left-to-right 
construction of syllabic trochees and the END RULE FINAL, produces the desired metrical 
structures, as illustrated below. 


(6) a. drbo <rem> b. cá me <ram> c. pedés <ter> 

hl ‘a Lh 

MORAIC TROCHEE (x) G a (x) 

SYLLABIC TROCHEE 

END RULE FINAL (x ) (x ) CX ) 

e. vò lup tá <tem> e. libe ra tid <nem> 

lh h 过 汪汪 

MORAIC TROCHEE (x) (x) 

SYLLABIC TROCHEE (x 过 (x .) (x .) 

END RULE FINAL ( x ) ( x ) 


By giving an extralinguistic, functional motivation to the formal elements of the 
descriptive vocabulary of metrical theory, we now have a more straightforward 
answer to the question raised at the end of Chapter 13: why is it that the stress patterns 
we find in the languages of the world are the way they are? In what way are they nat- 
ural? The foot inventory in (1) is a direct translation of the basic ways in which people 
apparently group sequences of elements of even duration and sequences of elements 
of uneven duration. 

We now return to Cairene Arabic, which we gave as an example of a language that 
has a rather complicated pattern of stress distribution. Its stress pattern cannot be 
covered by the descriptive framework discussed in Chapter 13, and requires special 
treatment. Consider the stress pattern in the words famálti ‘you (sc, FEM) did’, 
martdba ‘mattress’ and búxala ‘misers’. If we treat the final syllable as extrametrical, 
the stress pattern of búxala suggests a left-dominant foot. However, using a left-dom- 
inant QS foot makes the wrong prediction for the stress in martdba. The first heavy 
and the second light syllable should form a QS-ld foot, incorrectly predicting stress on 
the first syllable: *mártaba. If we were to use a QS-rd foot, the stressing in {amdlti and 
martdba comes out right, but now buxala is incorrectly predicted to be *buxdla. 
Therefore, a special foot type was proposed, one in which a dominant node could be 
strong if and only if it was branching, that is, heavy. A rd-foot on Samál can therefore 
indeed be labelled WS, because the dominant node mál branches, whereas the rd-foot 
on buxa cannot labelled WS, given that its dominant node xu does not branch. In this situ- 
ation, paradoxically, the rd foot will be labelled SW. It appears that, in the new model, 
an elegant description of these facts is possible. Quite simply, final mora extrametri- 
cality (which is suspended if this would make the entire final syllable extrametrical), 
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left-to-right moraic troche construction, followed by the END RULE FINAL, Will derive the 
correct results. 


(7) a. sa ka kii <n>  b. Sa mål ti 
j a a ran. A 
MORAIC TROCHEE | (x) 
- END RULE FINAL ( x ) enc 
c. mux tá lifa d. mar taba 
heresy Be A | 
MORAIC TROCHEE (x) (x.) (x) (x .) 
END RULE FINAL | ) (ie Ly 
e. bixala f. fadga rátu hu 
blstl | Wa | 
MORAIC TROCHEE o) a ie 
END RULE FINAL (x ) ( X ) 
g. Jadsa ratu hima <a> 
i 
MORAIC TROCHEE te He. 2) 
END RULE FINAL ( x ) 


In this section, we have discussed the refinements that led to the revised metrical the- 
ory of Hayes (1995). Three basic foot types are assumed which receive extralinguistic 
motivation. In the next section we will discuss the foot as a domain for phonological 
rules, and briefly investigate the relation between stress and morphology. 


Q113 Turkish normally has stress on the final syllable. However, there is a class of words, 
including mainly loans and toponyms, where stress is never on the final syllable 
(Sezer 1983). Consider the examples below. 


da ‘screwdriver’ 
‘restaurant’ 
place name 


An ka place name 
h 

Gi ko ta ‘chocolate’ 
| | | l 


Determine whether this stress system is quantity-sensitive. Motivate your answer. 

How can you account for the fact that stress is never on the final syllable? 

The absence of secondary stresses suggests that foot assignment is not iterative. 
Determine whether a moraic trochee or an iamb is required to account for the stress facts. 
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In ftalian, stress can be located on the final, penultimate or antepenultimate 
syllable. However, if the penult is heavy, stress never falls on the antepenult. 
Accordingly, loans with a prefinal heavy syllable and with stress on the antepenult 
are adapted. For instance, German [hamburk] is taken over as as [amburgo] (cf. 
Sluyters 1990), which suggest a quantity-sensitive foot. Apparently, we must 
assume that Italian uses a moraic trochee. 


How would you account for the existence of cornice ‘corniche’ by the side of cálice 
‘chalice’, both of which have a light prefinal syllable? 

Unexpectedly, stress can also fall on a final, light syllable, as in cittá ‘city’ and colibri 
‘colibri’. How could you account for this? 


Q115 Hayes (1995: 262) analyses Turkish nonfinal stress by constructing moraic trochees 
from right to left. The final foot is marked as extrametrical if a clash, i.e. a sequence 
of two adjacent (x)s, would otherwise arise. This alternative is illustrated in (1) with 
the metrical representation of Washington. 


(1) Wa shing ton 
eh h 
(x) <(x)> 


Demonstrate that the two analyses are not in fact equivalent by considering their predictions of 
the stress patterns of items (2) and (3). 


(2) In di ya ń ` lis 


14.3 FOOT-BASED RULES 


14.3.1 Quantitative adjustments 


As we have seen in section 14.2, iambic rhythm is typically applied to sequences of 
elements of uneven duration, whereas trochaic grouping typically concerns even - 
duration. It may come as no surprise that languages with iambic rhythm tend to have 
segmental processes whose effect is to increase the durational unevenness. Consider 
again some of the Hixkaryana forms discussed in (5) above, repeated in (8). 


(8) a. òw to hb<na> b. mihà nanih <no> 
hl kte Eeh 
IAMB HK) (Sf M EA 


END RULE FINAL ( Kc) ( x ) 
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In (8), not all disyllabic feet are identical. For instance, in the last form the first foot is 
of even duration, whereas the second one is of uneven duration. Now, languages with 
iambic rhythm typically have rules that lengthen the vowel in the stressed syllable of 
the foot, rules that geminate the consonant after such a stressed vowel, causing the syl- 
lable to be closed, or rules that reduce the vowel in the unstressed syllable, thereby 
increasing the durational contrast within the foot. Hixkaryana has the first quantita- 
tive adjustment: a vowel in an open nonfinal syllable becomes lengthened if it occurs 
in the stressed position of a foot. This will change the forms in (8) into those in (9). 


(9) a. Ow to hó: <na> b. mi há: na nih <no> 
cue ch he f h 

IAMB W 芭 CoR Ge) 

END RULE FINAL ( X ) ( X ) 


Strikingly, according to Hayes (1995), languages with trochaic rhythm lack rules of the 
kind mentioned above. However, not all foot-based rules are quantitative adjustment 
rules. In the next section it is shown that the foot can figure in the structural descrip- 
tion of segmental phonological rules that are not directly related to optimizing the 
iambic rhythm. We illustrate the role of the foot in two lexical rules of Dutch. 


14.3.2 Foot-based segmental rules 


In this section we are concerned with the more general role of the foot in the struc- 
tural description of phonological rules. Two lexical rules of Dutch will illustrate this 
role, 9-INSERTION in diminutives and PRE-T LENGTHENING. 

In Chapter 7 we formulated the rule of 3-INSERTION as in (10). 


F . 
(10) 3-INSERTION Ø 一 3/ [—tense] ben +— talons 


According to (10), schwa is inserted if the noun stem ends in a lax vowel followed by a 
sonorant consonant, as illustrated in (11). However, the schwa does not always appear 
when these conditions are met. In particular, the words in (12) fail to undergo the rule. 


(11) man ‘manaca ‘man 
bom ‘bomaca ‘bomb’ 
‘vandalin ‘vanda linaca ‘walk’ 
‘ho:rizon ‘ho:ri,zonaca ‘horizon’ 

(12) = 'me:diam 'me:diəmpjə ‘medium 
‘be:zam 'be:zampj3 ‘broom 
‘parlin ‘parlinkja ‘eel’ 
‘albym ‘albympjo ‘album 
‘sersam ‘seisampja ‘sesame 


'sairon ‘satronkja ‘sarong’ 
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The words in (11) differ from those in (12) because their bases have final syllables with 
either primary or secondary stress. This suggests that the rule is prosodically condi- 
tioned. More specifically, in addition to the requirement that the stem-final syllable 
should have a rhyme that consists of a lax vowel and a sonorant consonant, it must be the 
head of a foot (Kooij 1982, Gussenhoven 1993). This version of the rule is shown in (13). 


(13) 3-INSERTION 
| 
Oo 


oo 


mh 
6—=3/ | 
+cons | —X]pim 
[ates 


For this analysis to go through, final post-stress syllables must be included as weak 
syllables in the foot, regardless of whether they have schwa (14a) or a vowel other than 
schwa (14b).! If the final syllable is preceded by an unstressed syllable, however, it 
forms a foot by itself (14c). The word ['me:diam ] ‘medium is a monomorphemic ter- 
nary foot. Such feet in Dutch always have an onsetless syllable in final position, while 
the preceding syllable has [i] or [y] (Kager 1985). 


(14) a. b. 


Ww @ 

| | 

F F 

de ZS 
o o o o 
| | ta 

be: zəm sai ron ho: ri zən m 


PRE-T LENGTHENING Causes tense vowels in Dutch to be long before [r]. The effect of the 
rule is particularly noticeable in the case of tense [i,y,u], which are short in other con- 
texts. Thus, [bit] ‘beetroot’, [brysk] ‘brusque’ and [zvul] ‘sultry’ have short vowels, 
while [bi:r] ‘beer’, [by:rt] ‘neighbourhood’ and [zvu:rt] ‘rind’ have long vowels. The 
domain of this rule is the foot. Examples are given in (15), in which the primary and 
secondary stresses have been indicated in accordance with the foot structure outlined 
above. Interestingly, the rule cannot be directly related to stress: it applies in primary 
stressed syllables in (15a,b), in secondary stressed syllables in (15d,e) and in an 
unstressed syllable in (15c) (cf. (14b)). 


! Some speakers will give the form [sa:ronatja] as the diminutive of ‘sarong’. There are very few words that have 
the required context of a post-stress, final syllable with a lax vowel and a sonorant consonant. Trommelen and 
Zonneveld (1989) assume that, at least at earlier stages of the derivation, Dutch foot structure is somewhat dif- 
ferent. In particular, all closed syllables are feet in their analysis (see also Gussenhoven 1993). 
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(15) a. vir ‘algae’ 
b. uttrotk ‘incense’ 
c. drtyir ‘Arthur’ 
d. Grlivi:r ‘Oliver’ 
e. pkramént ‘barrel organ’ 


If the rule requires the vowel and [r] to be in the same foot, rather than in the same 
word, one should expect a word like [pi'ra:t] ‘pirate’, in which the final foot [ra:t] con- 
tains [r] but not the preceding [i], not to undergo it. This is indeed the case, as shown 
in (16). 


(16) pirait *purrait ‘pirate’ 
hurá: *hurra: ‘hurray’ 
k4rdyr3j *kSrdyir3j ‘corduroys’ 
àtmirá:l *àtmi:rá:l ‘admiral’ 


Q116 English [n] obligatorily appears before dorsal consonants in the words in (1), but 
varies with [n] in the words in (2) (Kiparsky 1979). What is the domain of the 
obligatory occurrence of [n]? Motivate your answer. ‘ 


ink ink 

inkrament increment 
inkriss or Inkris increase (verb) 
和 kri:s or fnkris increase (noun) 


Q117 Dutch [h] is pronounced in the words in (1), but despite the spelling, no [h] is 
pronounced in the words in (2) and (3). What determines the occurrence of [h]? What 
conclusion can be drawn with respect to the prosodic structure of the word in (3)? 

[hút] ‘hat’ 
[a:.bra.ham] ‘Abraham’ 
{jo.hd.nas] ‘John’ 
[ho.tél] ‘hotel’ 


Niehe Ini al (proper name) 
aloha [a.ló;.a] ‘aloha’ 
tahu [tá:.u] ‘tofu’ 


marihuana [ma:.ri.u.a:na] 


14.4 STRESS AND MORPHOLOGY 


Up until now it has been tacitly assumed that the domain for the foot construction 
rules is the entire word. That is, feet were constructed only after the complete word had 
been formed. This is an oversimplification. In many languages, the normal stress pat- 
tern is interrupted by the morphological structure of words. In English, for instance, 
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stress in complex words does not follow the same pattern as stress in underived 
words, as is evident from a comparison of the complex word Japanése with the simplex 
word Sócrates. We will now briefly discuss one way in which the morphological struc- 
ture of words can be taken into account by the stress rules of a language. Consider the 
following Latin forms in (17). 


(17) ai td que ‘andso 
l l 
b. ple rd que ‘the majority’ 
] l 


人 vi rúm que ‘and the mar 
l h 
dre- ne gat  ‘deny-again-3sg-pres-indic’ 


in (17a,b), stress is on a light penultimate syllable. This is surprising, because in words 
of more than two syllables stress is always on the antepenult if the penult is light, as in 
(17d). The forms in (17) are all morphologically complex: the first three consist of a 
stem to which a clitic -que ‘and’ is attached, while the last form consists of a verb stem 
to which a prefix re- again is attached. Apparently, the stress rules have to take into 
account the morphological structure of words: a prefix like re- has no effect on the stress 
algorithm, while a clitic like -que does. A possible way to account for this in metrical the- 
ory is to divide the morphological operations (prefixation, suffixation, compounding, 
cliticization) into two blocks: a cyclic and a noncyclic block (Halle 1990, Halle and 
Kenstowicz 1991). The assumption is that after each morphological operation the stress 
rules are reapplied. The crucial difference between the morphological operations in the 
cyclic block and those in the noncyclic block is that in the cyclic block all previously 
assigned metrical structure is erased by the Stress Erasure CONVENTION every time the 
stress algorithm is applied again. In the noncyclic block, the Stress Erasure CONVENTION is 
not active, which means that previously assigned metrical structure is still there after 
the morphological operation. A reapplication of the stress rules can then only apply to 
free, i.e. not already metrified, elements. Morphological operations that do not disrupt 
the stress pattern of simple words are thus located in the cyclic block, while morpho- 
logical operations that do interrupt the normal stress pattern are located in the non- 
cyclic block. For the forms in (17) this means that prefixation takes place in the cyclic 
block, whereas encliticization takes place in the noncyclic block. In (18) we have illus- 
trated this by contrasting the prefix re- with the enclitic -que. 


(18) Cyclic block 


ne gat i ta 

1 h | 
EXTRAMETRICALITY <g> <0 > 
MORAIC TROCHEE (x) (x) 
STRESS ERASURE CONVENTION ne gat i ta 
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PREFIXATION re ne gat i ta 
Lulil h l l 
re ne gat i ta 
17i] h l l 
EXTRAMETRICALITY <o> <o> 
MORAIC TROCHEE (rd (x) 
Noncyclic block 
re ne gat i ta 
[eet h l l 
na) (x) 
ENCLITICIZATION re ne gat i ta que 
ba | h l l h 
EXTRAMETRICALITY Ga.) <o> (x) <g> 
re ne gat i ta que 
H~] h l l h 
MORAIC TROCHEE a j| <o> + (x) no 
END RULE FINAL (x ) ( Xx ) 


The application of the stress rules in the noncyclic block can, in the case of itaque, only 
apply in the penultimate syllable, given that the previously assigned metrical struc- 
ture (the foot assigned to the first syllable of ita in the cyclic block) must be respected. 
This leaves the penultimate, light syllable as the only free element. For a more detailed 
discussion, see Jacobs (1997). In the last section of this chapter, we will briefly discuss 
how stress patterns are accounted for in the non-derivational framework of 
Optimality Theory. 


14.5 STRESS AND OPTIMALITY THEORY 


As mentioned in Chapter 3, in Optimality Theory (OT) phonology is thought of as a 
universal set of constraints which are hierarchically ranked on a language-specific 
basis. The relation between input and output is accounted for by respectively generat- 
ing for each input all possible outputs and evaluating these outputs so as to select the 
optimal one. Importantly, constraints may be violated, depending on the ranking of 
other constraints. Let us clarify this with a concrete example. Tagalog has a prefix um- 
signalling ‘actor trigger’ on the verb to which it is attached (see Prince and Smolensky 
1993: 34-7, McCarthy and Prince 1993b: 119-21). If the prefix is attached to a vowel- 
initial verb, such as [aral] ‘to teach’, the expected result is [umaral]. Surprisingly, how- 
ever, if um- is attached to a consonant-initial verb, such as [gradwet] ‘to graduate’ 
or [sulat] ‘to write’, it behaves suddenly as an infix, producing [grumadwet] 
and [sumulat], respectively. OT offers a very elegant solution in terms of constraint 
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domination. Two constraints are invoked by Prince and Smolensky (1993: 36). One is 
a constraint that states that the prefix should be at the left edge of a word, which we 
will simply term Auran for the moment, and the other one is NoCopa, which states that 
a syllable must not have a coda. If the constraint ALIGN dominates NoCopa, the prefix 
will always show up as a prefix, much like, for instance, the English prefix re-. In order 
to obtain the Tagalog facts we need to assume the opposite ranking: NoCopa domi- 
nates ALIGN. In (19) this is illustrated for an input /um- Livet in a tableau. Recall 
from Chapter 3 that rs indicates the optimal candidate, that * indicates a constraint 
violation and that ! indicates that a constraint violation is fatal. 


er ee Ce a 


In (19) candidates a and b violate the higher ranked constraint NoCopa three times, 
and are therefore less optimal than the remaining three outputs, which violate NoCopa 
only twice. Output c is optimal because it has the prefix um- as close as possible to the 
left edge. In (20) we have listed some possible outputs for an input /um-aral/. 


(19) 


c. a.ra.lum 
t= d. u.ma.ral 


In the final section of this chapter we will illustrate how stress distribution can be 
accounted for in OT. 


14.5.1 Constraining stress 


To characterize right and left dominance in feet, the constraints RuType(I) and 
RuType(T) given in (21) and (22) are assumed. 
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(21) RuType(1): Feet must be iambic. 


(22) RhType(T): Feet must be trochaic. 


In order to derive iambic grouping rather than trochaic grouping, the constraint 
RHTYpE(D) must outrank the constraint RuTyre(T). If the language is trochaic, the 
opposite ranking is required. In order to ensure binary grouping, the constraint in 
(23) FrBin is assumed, and to achieve the effect of extrametricality, the constraint 
NoNFINALITY is needed: 


(23)  FrBun: Feet must be binary (either two moras or two syllables). 


(24) | NonFinauity: A foot may not be final. 


McCarthy and Prince (1993a, 1993b) capture the left-to-right or right-to-left parsing 
of feet by appealing to their theory of generalized alignment. This theory amounts 
to a family of alignment constraints, which can be used for a variety of purposes. For 
instance, as we have seen above and in Chapter 3, the position of an affix is determined 
by a constraint which aligns one of its edges with the corresponding edge of the word. 
Most commonly, an alignment constraint stipulates that the right or left edge (R/L) of 
a grammatical category (GCat, among which are the morphological categories Root, 
Stem, Morphological Word, Prefix, Suffix, etc.) must coincide with the right or left edge 
(R/L) of a prosodic category (PCat: w, ©, foot, phonological word (or œw), phonological 
phrase, etc.; see Chapter 15) according to the general schema in (25). 


(25) General schema for ALIGN 
Align (GCat, R/L-Edge, PCat, R/L-Edge): the R/L-Edge of some GCat must 
coincide with the R/L-Edge of some PCat. 


To ensure that footing begins at the edge of a stress domain, the constraints in (26) 
and (27) are necessary. 


(26) ”AUGN (w, R, Ft, R): the right edge of the phonological word is aligned with the 
right edge of a foot. 


(27) ALiGN (w, L, Ft, L): the left edge of the phonological word is aligned with 
the left edge of a foot. 


In a word of three syllables, (oo )o will be more optimal if the ranking (27) » (26) 
obtains, while o (o0) will be more optimal if (26) » (27). Next, in order to derive itera- 
tive footing, the constraints in (28) and (29) are assumed. 


(28) Auen (Ft, R, w, R): the right edge of every foot is aligned with the right 
edge of the phonological word. 
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(29)  Auicn (Ft, L, œ, L): the left edge of every foot is aligned with the left edge of the 
phonological word. 


Recall that in OT constraints may be violated, depending on the ranking of other con- 
straints. The constraints Aticn(Ft) in (28) and (29) above are in competition with the 
constraint Parse-o in (30). 


(30) — Parse-o: Parse syllables into feet. 


If this constraint is higher ranked than (i.e. dominates) the Aticn(Ft) constraint (29), 
for instance, iterative footing results. The footing (ca)(aa) for a word of four sylla- 
bles will be more optimal than (ow)oo. The latter output violates the higher-ranked 
Parse-o constraint, while the former does not. 

Let us illustrate these constraints by deriving the stress pattern of Garawa (cf. 
Kenstowicz 1994b: 16). Because in Garawa no monosyllabic feet occur and because 
the grouping is trochaic, both the constraints FrBin and RuTyre(T) must be highly 
ranked. For Garawa, the alignment constraints in (31) and (32) are necessary. 


(31)  Aticn-w: Align (w, L, Ft, L) (= (27)) 
(32)  Aticn-Ft: Align (Ft, R, œw, R) (= (28)) 
In (33) we illustrate the interaction between these two constraints in an odd-syllabled 


word like [kamalarinyi]. As shown in (33), the actual ranking we need is the one in 
which (31) dominates (32). 


us a. (kama)la(cingi) 


(33) a. 


b. ka(méla)(rinji) 


kamalarinji Al (Ft, R, w, R) Al (@, L, Ft, L) 


a. (káma)la( (ringi) ooo#! 


ar b. ka(méla)(cingi) [oot 


Notice that for Garawa the constraint FTBIN must crucially dominate the constraint 
Parse-o. If not, the left-over syllable in (33) would be parsed as a foot. Finally, we illus- 
trate the interaction between the constraint Parsg-o and ALIGN(Ft) in [jakalakalampa] 
in (34). Parse-o must dominate the Auicn(Ft) constraint in order to derive iterative 
footing, as shown in (34a). 
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jakalakalampa Al (œw, L, Ft, L) | Parse-o Al (Ft, R, œw, R) 


jakalakalampa Al (w, L, Ft, L) | Al(Ft,R,@,R) | PARSE-G 


e b. (jáka)laka(lámpa) 


Q118 The ranking we have established thus far for Garawa is the following: 
FTBIN » RHTYPE (T) » AL (w, L, Fr, L) » PARSE-o » AL (FT, R, œ, R). 


Draw a constraint tableau for an input /jankirikirimpaji/ and show what happens to the 
following six possible output forms: 


[nianki) (ríki) (rimpa)ji] 


[an) (Kiri) (kirim) (páji)] 
[(nánki ) sikirim (paji)] 

[(nánki ) ri (kirim) (páji)] 
[(nanki ) (rfki) rim (páji)] 
[(ganki) ri (kirim) (páji)]. 
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(36) Auen (Heap, L, w, L) (H/L): The foot with main stress is aligned with the left edge 
of the phonological word. 


For the Garawa stress pattern we need to assume that ALIGN (HEAD, L, w, L) dominates 
ALIGN (HEAD, R, w, R). 

Let us finally consider Hixkaryana stress in the OT perspective. Given that iambs are 
required, RHTYpE(IT) must be higher ranked than RuTyre(T). Moreover, given that stress 
is never final, the constraint NONFINALITY must be highly ranked. Furthermore, iambs 
are assigned from left to right (to use derivational terminology), so Auicn(Ft,L) domi- 
nates Auicn(Ft,R). As footing is exhaustive, the constraint Parse-o must dominate the 
Alignment constraint. Also, no degenerate feet are permitted, which points to the rank- 
ing of FTBIN above Parse-o. This gives us the hierarchy in (37). 


(37) | RuType(1) » FrBin ，NONFINALITY » Parse-o » ALIGN (Ft, L, w, L). 


In (38) we consider some of the possible candidates for an input [tohkuriehonal. 


(38) 


| tohkurehma | RaT(l [FrB | | FBIs | Nove Parse-@ | ALIGN 


“eae Cessna Pe 


b. toh(kur’e)(héna) rs o#, 
ooo# 


c. (téh)(kur'é)(h6)na *] oi 
ooo# 
dt | | | [efor | 


In (38), we have not considered a possible candidate in which the first two syllables are 
grouped into a foot. If a candidate [(tohkú)(r'ehó)na] were added to tableau (38), it 
would wrongly be characterized as the optimal form. It is obvious that the first foot is 
not a proper quantity-sensitive iamb. In order to ensure the quantity-sensitivity of 
stress in languages like Garawa, two constraints have been proposed, the Weight-to- 
Stress Principle (WSP), demanding that heavy syllables are stressed and Weight-by- 
Position (WBP), discussed in section 10.5. They are here listed as (39) and (40), 
respectively. In tableau (41), which illustrates the final ranking for Hixkaryana, they 
have been collapsed to a single constraint QS. 


(39)  Weight-to-Stress Principle (WSP): Heavy syllables are stressed. 


(40) Weight-by-Position (WBP): A coda consonant is moraic. 
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要 
一 FT e eee 
rr a 


Q119 Latin main stress is quantity-sensitive, as shown by the forms in (a). In some 
varieties of Macedonian (cf. Baerman 1999) stress is fixed on the antepenuitimate 
syllable, as shown by the forms in (b). 


(a) Latin quantity-sensitive stress (b) Macedonian quantity-insensitive stress 
ca-me-ram ‘(bed)room’ d6-ve-dam ‘bring-1SG-PRES’ 
so-li-cu-lum ‘little sun’ do-vé-de-te ‘bring-2PL-PRES’ 
pe-dés-ter ‘on foot’ do-vé-dox-te ‘bring- 2PL-AORIST’ 


Determine what the ranking of the constraints At (w, L, Fr, L), AL (w, R ,FT R), NoNFINALITY, QS 
and ParsE-o for each language must be. 


Q120 Yurakaré (van Gijn (in prep.)), has a stress pattern exemplified by the following words: 


ta.ta ‘father’ 

ku.da.wa ‘lake’ 

ti.pé.ho.re ‘my canoe’ 

ma.la.no.lé.ju ‘don’t fall in love with him’ 


The following seven constraints are necessary to correctly characterize output forms: 
At (w, L), AL (w, R), NonFinaity, (H/L), (H/R), PARSE-o and FTBIN. 


Assuming output candidates that have iambic feet, propose an analysis and illustrate it with 
tableaux for [tata], [tipdhore] and (malanoléju]. 


Q121 Cavineña (Key 1968), a Quantity-insensitive language, displays the stress pattern 
illustrated in the following data: 


é.na ‘water’ 
ki-ri.ka ‘paper, book’ 
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à.si.ká.da ‘dirty’ 
ma.to.ha.i .wa ‘cactus-like plant’ 
a.ta.ta.wa.ha ‘kind of bee’ 


The following constraints will enable you to characterize the correct output forms: 


a. FTBIN 

b. AuiGn(Ft, L, œ, L) 
c. AuGN(Ft, R, œ, R) 
d. RHTYPE(T) 

e. RHTYPE() 

f. PARSE-Cr 


1. Identify the pairs of constraints that are crucially ranked. 
2. Demonstrate the interaction of each pair by giving two tableaux for some input form, one 
with the correct ranking and one with the incorrect ranking. 


Q122 In Creek (Haas 1977, Hayes 1995), main stress is realized as an H-tone. The 
following words are representative. 


ifa ‘dog’ 

i.f0.ci ‘puppy’ 
po.cds.wa ‘axe’ 
a.pa.ta.ka ‘pancake’ 
ak.to.pa ‘pridge’ 
al.pa.td.ci ‘baby alligator’ 
naf.ki.ti.ka:.ji.tá ‘hit-pl-obj’ 


Provide an OT account that characterizes the foot to which the H-tone will be associated. 


14.6 CONCLUSION 


We have discussed the refinements that led to the revised metrical theory of Hayes 
(1995). The syllabic trochee, the iamb and the moraic trochee have been used to describe 
the stress patterns of Garawa, Weri, Hixkaryana, Classical Latin and Cairene Arabic. The 
notion of catalexis was introduced and it has been shown that destressing rules are no 
longer needed. After that, we discussed the foot as a domain for segmental processes that 
have the effect of increasing the durational unevenness in languages with iambic rhythm, 
and as a domain for segmental rules that are not ~ or no longer — obviously related to opti- 
mizing rhythm. In addition, we considered the way in which the morphological structure 
of words can be taken into account by the stress rules of a language. Finally, it was demon- 
strated how stress distribution can be accounted for in Optimality Theory. Our final chap- 
ter considers phonological adjustments whose structural description somehow makes 
reference to more than one word, and deals with the theory of Prosodic Phonology. 


Phonology above the word 


15.1_INTRODUCTION 


The phonological structure of languages extends beyond the syllable and the foot. There 
is a hierarchy of phonological constituents, so that lower, and thus typically smaller con- 
stituents, are contained within higher, and thus typically larger ones. For instance, the 
foot is contained within, or dominated by, the phonological word, and phonological 
words are grouped into phonological phrases, etc. Instead of ‘phonological constituent 
the term ‘prosodic constituent is often used, particularly for the higher constituents, like 
the phonological phrase, the intonational phrase and the phonological utterance. The 
entire structure above the syllable is often referred to as the ‘Prosodic Hierarchy’. This 
line of research began with Selkirk (1972) and was consolidated by Nespor and Vogel 
(1986), Hayes (1989) and, for the phonological word, Booij (1985), among others. 

In this chapter, we discuss and illustrate the phonological utterance (U), the inton- 
ational phrase (IP), the phonological phrase (@) and the phonological word (w). In (1), 
an example of a sentence is given that has been parsed into these constituents. The next 
two lower constituents are the foot, which was discussed in chapters 13 and 14, and the 
syllable, which was discussed in section 10.6. Notice, for instance, that weak forms, 
the function words were and to, are not separate phonological words, but are included 
with the following form in the same w. In fact they aren't even feet, since they have no 
stress. Before discussing the prosodic constituents in more detail, we first deal with 
three general questions. First, how do prosodic constituents manifest themselves? 
Second, what is the general structure of the prosodic hierarchy? And third, what deter- 
mines the prosodic structure of specific sentences? 


(1) U phonological utterance 
ae 
IP IP intonational phrase 
| went 
中 中 中 Phonological phrase 
pT, | | 
w w w w phonological word 


meni pju:plz wəsləu tə raspond 


Many pupils were slow to respond 
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15.2 GENERALIZATIONS INVOLVING PROSODIC CONSTITUENTS 


Prosodic constituents manifest themselves in four ways: 


i 


Boundary strength. There is a greater degree of articulatory integration in 
lower constituents than in higher ones. For instance, within an English sylla- 
ble like [lamp] as in Lampton, the articulatory gestures occur relatively 
quickly after each other, but within higher-ranking constituents, the 
sequence of movements in [lamp] will be less tightly coordinated, as in A 
fulll amp]utation or A [lamb p]assed by. That is, prosodic breaks between 
higher constituents are stronger than those between lower constituents. 
Boundary strength will reveal itself in a number of phonetic measures. The 
extent to which the articulation of a vowel is influenced by that of a vowel or 
consonant in a preceding syllable will depend on the strength of the bound- 
ary between the trigger and the vowel concerned. Such coarticulation will 
cause a schwa to be closer in [3pi:] than in [apa:], in anticipation of the 
tongue position of the following vowel. However, this effect will be stronger if 
the two vowels occur within an w, as in [a]ppeal, than when they occur in 
different ws, as in Emm[a] Peel (cf. Cho 2004). Also, the duration of the last 
syllable of a prosodic constituent is typically longer as the lengthening is 
stronger (or ‘higher’, thinking hierarchically), which is known as final lengthe- 
ning. Equally, the initial segments of a constituent are more clearly pro- 
nounced as the boundary is stronger, which is known as initial 
strengthening (Keating, Cho and Hsu 2004). 


Boundary tones. Higher constituents are often characterized by intonational 
boundary tones. When that happens, the boundary is particularly easy to 
hear. Examples of this will be given when we discuss the phonological phrase 
and the intonational phrase. 


Postlexical phonological processes. Prosodic constituents frequently deter- 
mine the distribution of segments and the application of phonological 
processes. We have already seen in section 10.6 how the syllable functions in 
phonological generalizations, and in section 14.3.2, we saw the foot in the 
same role. The ways in which reference is made to phonological constituents 
have been classed into three types. They were given as ‘rules’ by Selkirk 
(1980), who had phonological processes in mind, but they might usefully be 
extended to any distributional facts. 


a. Domain limit constraints. Reference is made to the left or right edge of a 
constituent. In many languages, phonological words must end in a conson- 
ant, as in Tagalog for instance, although syllables occur freely without a 
coda word-internally. 

b. Domain span constraints. The context and the focus of some Ri 
tion must occur with a single constituent of some rank. For instance, as 
we will see below, Italian has a rule of s-voicinc, which causes [Z], never 
[s], to appear between vowels contained within a phonological word. 
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c. Domain juncture constraints. The context of the generalization may 
include the left and right edges of adjacent constituents of some rank, pro- 
vided this boundary occurs within some higher constituent. For instance, 
as we will see, [s] is optionally voiced to [z] in Dutch if it occurs finally in 
the phonological word and the next phonological word begins with a 
vowel, provided this boundary falls within an Intonational Phrase. 


4. Phonology-sensitive syntax. Lastly, it has been observed that syntactic and 
morphological rules may be sensitive to the size of constituents (Zec and 
Inkelas 1990). English has Heavy NP SHIFT, which allows the movement of the 
object in (2) to clause-final position. However, a condition on the rule is that 
the object NP must consist of more than one phonological phrase. For this 
reason, (3a) cannot move its NP. By contrast, (3b) shows that either order is 
fine if the object NP contains more than one phonological phrase. 


(2) Ta : 


(3) a. He gave the book to her aunt (* ... to her aunt the book) 
b. He gave to her aunt the book about Mozart (or: the book about 
Mozart to her aunt) 


15.3 THE STRICT LAYER HYPOTHESIS 


What does the prosodic hierarchy look like? A common view is that the constituents 
obey the STRICT LAYER HYPOTHESIS, which is perfectly obeyed by representation (4). 


(4) ( )U 
( )IP 
( )( )( )( ) 中 
( i )( )( )( )( Jo 


There have been a number of formulations of this principle (Selkirk 1984, Ladd 1996). 
We give it as (5). 


(5) Strict Layer HYPOTHESIS (SLH): A prosodic constituent of rank n is immedi- 
ately dominated by a single constituent of rank n + 1. 


An w, for example, will be dominated by 9. If it isn't, as in (6a), the structure is non- 
exhaustively parsed, in this case into constituents of rank 2. (In Chapter 14, we already 
saw that nonexhaustive parsing of syllables into feet in fact does sometimes occur; 
we come back to this point in section 15.9.) As a result, segment A in (6a) may begin 
a syllable without at the same time beginning a foot. If an w were to be dominated 
by two 中 s, we would create the incoherent occurrence of a p-boundary within an w. 
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This type of violation of the SLH, shown for the boundary between A and B in (6b) 
and known as ‘improper bracketing’, is ruled out completely. Third, if a constituent 
were to dominate a constituent of the same rank, we would have recursiveness. A 
structure is recursive if some constituent appears within a constituent of the same 
rank or a lower rank. In (6c), for instance, two constituents of rank 1 are contained 
within a constituent of the same rank. There have been various claims that same-rank 
recursiveness in prosodic phrasing must in fact sometimes be recognized. We will 
illustrate a case in section 15.7. 


(6) a ( Bp Xb j3 ell ae 
(aru )2 ( yo 2 ( )1 
LE FL ( XAB) )1 C PRATA 


As observed in Chapter 2, recursiveness in the morphosyntactic structure is one of 
the hallmarks of human language. It expiains why sentence length is infinite, as in 
This is the cat that caught the rat that stole the cheese that lay on the table that ..., where 
every NP except This has the structure [...[.-.]slx» as in [the cat [the cat stole the 
cheese];]yp in which the S has an NP which takes the form [...[...],],p» and so on, 
ad infinitum. Similarly, sentences may appear as premodifiers within NPs, as in an 
I-couldn't-care-less attitude. 


15.4 FACTORS DETERMINING PROSODIC PHRASING 


What determines where these prosodic constituents begin and end? Not surprisingly, 
an important factor is the morphosyntactic structure. It would, to give an extreme 
example, be unexpected for the main break in (7a) to occur between fo and arrive, 
with smaller breaks as indicated. Rather, we would expect something more like (7b). 
In fact, in (7b), the smaller domains are phonological phrases and the larger ones 
intonational phrases. As will be clear, they correspond with syntactic phrases, unlike 
the bracketed parts of (7a). By somehow marking off the meaningful constituents in 
the pronunciation, parsing of the expression will be easier for the listener. 


(7) a. {(The first) (train to)} {(arrive is the) (one from Paris)} 
b. {(The first train) (to arrive)} {(is the one) (from Paris)} 


However, a secondary role is played by constituent length. Since morphosyntactic 
constituents of a given rank may vary hugely in length, a one-to-one correspondence 
between phonological and morphosyntactic constituents would put unreasonable 
demands on speakers. For instance, the large NP after see in (8a) is syntactically 
equivalent to the word her in (8b). It would be quite a strain on the speaker to produce 
a phonological phrase that runs all the way from the old to road, while it would equally 
be awkward to produce two in quick succession for J can see and her. In (8a) there is 
too much phonological structure and in (8b) too little for a comfortably rhythmic 
occurrence of prosodic breaks. There would appear to be a tendency for constituents 
to consist of two lower constituents (Selkirk 2000). 
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(8) a. Ican see the old customs office at the end of the bend in the road 
b. I can see her 


Not surprisingly, a phonological phrase tends to be produced for each of the NP-internal 
phrases in (8a): (the old customs office), (at the end), (of the bend) and (in the road). 
And in (8b), the NP her will be incorporated with the preceding see into a single 
phonological word, pronounced [si:3], to rhyme with Maria. This incorporation of 
phonologically weak words into adjacent words is called cliticization, and her here is 
a clitic that attaches to the host see. Cliticization of phonologically weak words in 
phonological word formation is a specific case of what is called restructuring, the 
incorporation of phonologically light structures with adjacent words into a phono- 
logical constituent which it would otherwise have had to itself. We will see an example 
in section 15.5.4. Languages must be expected to have ways of preventing short 
phonological constituents. 

In (8a), the correspondence between the syntactic and the phonological structure 
is still in a sense one to one, since inside the large NP, the ‘maximal projection’, there 
are smaller phrases, PPs and an NP, or XPs for short, which correspond to phono- 
logical phrases. However, also when the syntax is identical, different prosodic struc- 
tures may be imposed on the grounds of length. For instance, an intonational phrase 
boundary is more likely after the subject NP in Hippopotamuses like to swim in the 
river than in John likes to swim in the river. Languages will vary in their preference for 
reflecting the morphosyntactic structure in the phonology at the expense of an even 
distribution of phonological constituents. 

A third factor is the information structure of the sentence. If someone answers 
the question When was Mozart born? by saying In January 1756, all of the information 
expressed is new to the hearer who posed the question. However, if the same expres- 
sion was said in response to Was Mozart born in January 1756 or in February 1756? 
only January would be the new information. These different focus constituents are 
indicated in (9a) and (9b), respectively. Languages have different ways of encoding 
such differences in information structure, or focus structure, and one of them is the 
prosodic phrasing. We will see an example of this in the discussion of the phonological 


phrase in Bengali. Other languages, like English, use pitch accents for this purpose 
(see Chapter 9). 


(9) a. In [January seventeen fifty-six] foc 
b. In [January];oc¢ seventeen fifty-six 


Alignment constraints will be able to take care of the coincidence of morpho- 
syntactic constituents and prosodic constituents. Selkirk’s (2000) ALIGNXP, for instance, 
given in (10), can be used to describe the phonological phrasing of the Basque sen- 
tence in (11a). It explains why (11b) is ungrammatical, because the right edge of the 
XP meaning ‘to Amaia’s grandmother’ does not end a 中 . The coincidence of prosodic 
constituents and focus constituents can likewise be accounted for by means of align- 
ment constraints. 


(10)  ALicnXP: Align the right edge of an XP with the right edge of a ọ. 
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(11) a. ,(Amaien amumari), ,(liburua), g(emon dotz0). 
Amaia-GEN grandmother-pat book give AUX 
She gave the book to Amaia’s grandmother 
b. *,(Amaien amumari liburua), ¿(emon dotzo)e 


An example of a constraint that considers phonological length is BINARY (Elordieta 
1997), a specific form of the general phenomenon that constituents mustnt be too 
short or too long. In fact, syntactically well-behaved (13b) is ungrammatical, because 
(12) outranks (10). 


(12) Binary: The first p of the sentence must contain minimally two ws. 


(13) a. (Amaiari amumen liburua), ,(emon dotzo), 
Amaia-DAT grandmother-GEN book give AUX 
She gave grandmother’s book to Amaia 
b. *,(Amaiari), ,(amumen liburua), ,(emon dotzo), 


Q123 if the syntactic structure of (13a) were to be used as a response to Did she give 
Joseba’s book to Amaia?, the focus constituent would be amumen, the ‘new’ 
information. The left edge of this kind of focus constituent must be aligned with a 
中 -boundary, as expressed in (1). 


(1) AucnFOC: Align the left edge of a FOC-constituent with the left edge of a o. 


The prosodic structure of the reply, in translation ‘She gave GRANDMOTHER’S book to Amaia’, 
is the one given as ungrammatical in (13b) of the text. 


1. How would you account for the fact that the first p consists of a single w? 
2. What would the prosodic structure be of the equivalent of ‘She gave grandmother’s 
BOOK to Amaia’? 


We now turn briefly to the four prosodic constituents. These are not the only prosodic 
constituents that are discussed in the literature. For instance, an ‘accentual phrase’ and 
an ‘intermediate phrase’ are often referred to. 


15.5 THE PROSODIC CONSTITUENTS 


15.5.1 The utterance 


Nespor and Vogel (1986) illustrate the domain span effect of the phonological utterance, 
or U, on r-LINKING in the standard variety of English spoken in England, RP. Like many 
other varieties, RP disallows the [一 cons] consonants [h,j,w,r] in the coda. Morphemes 
that end in nonhigh vowels ([9,19,€9,9:,a1,3:], as in villa, idea, fair, paw, car, stir) are fol- 
lowed by [r], if the next morpheme begins with an onsetless syllable, as illustrated in 
(14). (The [r] is traditionally known as ‘linking r when there is an [r] in the spelling, and 
as ‘intrusive linking r if there is not.) In (14a,b), r-LINKING is seen to apply within the 
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word and across words. However, the upper limit is the U: while it can apply across two 
sentences addressed to the same listener and not separated by a pause, it cannot apply 
across two sentences addressed to different listeners, even if they are spoken without an 
intervening pause. The examples (14c,d) illustrate that the U is not necessarily isomor- 
phic with a single syntactic sentence, but that there is nevertheless an upper limit to 
what can be accommodated within the same U. 


(14) a. sta: stir y(... sti[r]ing ...)y 
b. fea fair y(A fai[rlidea )y 
c. ‘filo Sheila y(Hi Sheila! [r]Everything all right?) y 
d. 'pi:ta Peter y(Hi Peter!) *[r] y(Open the window, Sheila), 


15.5.2 The intonational phrase 


The intonational phrase, or IP (often also I) tends to correspond to the root sentence, 
i.e. a single [NP VP]-structure without extrapositions or interruptions. Selkirk (1978) 
gives (15b), in which the extraposition in Pakistan and the restrictive relative clause 
which is a weekday have been assigned to separate IPs, leaving the root sentence, 
which would otherwise be a single IP (15b), to be divided over two IPs. However, as 
was the case with U, the division of speech into IPs is not purely syntactically driven. 
In particular, when the subject is longer than a single lexical word there will tend to be 
an IP boundary between the subject NP and the VP, as shown in (15c). 


(15) a. (In Pakistan), (Tuesday), ;(which is a weekday), (is a holiday), 
b. (Tuesday is a holiday), 
c. (The second Tuesday of every month), ;(is a holiday), 


The domain span effect of the IP can be illustrated with a rhythmic stress retraction 
rule affecting certain adverbials in Dutch, like [alteit] altijd‘always’, which can appear 
in a variety of sentential positions in the same IP. They have the word stress on the sec- 
ond syllable when no other pitch accented word follows in the same IP, as illustrated in 
(16a,b). The retraction of the stress to the first syllable occurs when the word is fol- 
lowed by a word with a pitch accent within the IP, as illustrated in (16c,d). (The pitch 
accent is indicated by {'].)! 


(16) a. (Naar de waterstanden luistert ze alti jd), 
to the water level reports listens she always 
‘The water leyel reports she will always listen to’ 
b. (Waar ze altijd naar luistert) , (zijn de waterstanden), 
c. (áltijd luistert ze naar naar de waterstanden), 
d. (Ze luistert áltijd naar de waterstanden), 


In many languages the IP is bounded by intonational boundary tones, a domain limit 
phenomenon from which the constituent derives its name. In English, nonfinal IPs are 
frequently closed by H; after a H*L pitch accent, which causes the final syllable of inci- 
dent in (17a) to have high pitch. The same H*L HI pattern may be used for questions, 


' This rhythmic phenomenon is distinct from the Dutch Rhythm Rule that applies to adjectives in NPs, the 
English equivalent of which is dealt with in 15.2.3 (Gussenhoven 1983). 
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as shown in (17b). IPs usually begin with an intitial Li. The second IP in (17a) has 
‘declarative’ intonation, and ends in Li. 


(17) a, pT OMEN apr 6 


a. 1(In préss reports of the incident); ¡(his name wasn’t méntioned), 
| | | | | | 
Ly H*L Hy fy H*L Li 
b. ¡(Have you read the préss reports of the incident?) 
| | | 
Í; H*L H 


15.5.3 The phonological phrase 


Although languages vary in the details of the correspondence, the phonological 
phrase, or ọ, tends to correspond to the syntactic phrase. Hayes (1989) shows that the 
中 defines the domain of the English RHYTHM RULE. In (18a), Chinese is an adjective 
inside the NP the Chinese dishes, while in (18b) the Chinese is an NP, the indirect object 
of gives. An adjustment of the stress pattern from Chinése to Chinese under the influence 
of the following main stress on dishes takes place in (18a) but not in (18b). Similarly, 
the German Rhythm Rule is sensitive to the German 中 in (19a), the headless object 
NP das hell-blaue forms a ¢ by itself, while in (19b), it forms part of the NP, and hence 
the ọ, das hell-blaue Bild. 


(18) a. On Tuesdays, he gives „(the Chinese dishes), 
b. On Tuesdays, he gives * (the Chinése), (dishes), 


(19) a. Ich fand ¿(das hell-bléue), e(schon), 
‘I found the light-blue one beautiful’ 
b. Ich fand (das héll-blaue Bild), 
‘I found the light-blue picture’ 


In Bengali, the @ is phonologically marked by a final boundary tone H, if it contains 
an intonational pitch accent L*, as shown in (20a). Because the right- -hand boundary 
of the ọ is sensitive to the focus of the sentence in Bengali, (20a) contrasts with (20b), 
whose focus is confined to the first constituent of the compound word for ‘fishhead’. 
(The focus is indicated by capital letters in the glosses. The examples also include 
an IP-final Li.) The Bengali @ equally defines the domain of the rule of r-DELETION 
(Q43) and a regressive voicing process (Hayes and Lahiri 1991). 


(20) 


a. tumi ate mat fher-matha) ọ ọ (ranna-korle), ); 
L* Hy Ly 
you which fishhead cooked 
‘Which FISHHEAD did you cook?’ 
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b. a. tumi ,(kon mat f her) so(math ay p(ranna-korle), Ji 


L* Ho Li 
‘The head of which FISH did you cook?’ 


Again, syntactic constituency does not provide the only relevant information for 
deriving p-structure. Constituents to the right of the NP head in English, such as the 
PP of ancient China in the NP the Chinése of dncient China, form their own 中 ,as shown 
by the absence of rhythmic stress shift on the word Chinése. However, when the post- 
posed PP does not contain a lexical head, like the PP on him in the NP that report on 
him, a single 中 is formed containing the whole NP. The relevance of this distinction 
was shown by Zec and Inkelas (1988), who pointed out that the syntactic rule of HEAVY 
NP SHIFT, which places an object NP in sentence-final position, only applies if the NP 
is composed of at least two gs. This is shown in (21), taken from Inkelas (1989). In the 
ungrammatical b-example, the object is only a single 中 . 


(21) a. Mary gave to Susan (that report), ẹ(0n Dukakis), 
b. *Mary gave to Susan (that report on him), 


The examples in (22a,b) (Nespor and Vogel 1986) illustrate another way in which non- 
syntactic information is relevant to p-formation in (American) English, as revealed 
by the behaviour of the RHYTHM RULE. In (22a), [iri:pra'du:s] réprodtice has the main 
stress on the last syllable, which pronunciation is as expected when it is the last word 
in the ~. However, in (22b) it has undergone stress shift, a pronunciation that requires 
that it should be followed by another accented word in the same ọ. This is explained 
by the assumption that if the Adverbial Phrase consists of a single phonological word, 
it can optionally be included in the ¢ on its left, instead of forming its own 中 . 


(22) a. Rabbits ,(réprodiice), (quickly and diligently), 
b. Rabbits ,(réprodiice quickly), 


Q124 What is the ¢ structure of sentences (1) and (2)? Why is (2) ungrammatical? 


(1) Iwas explaining to the students the problem of the double negation in Middle English 
(2) *l was explaining to the students the problem 


Q125 — The English rhythm rule can apply in (1), but not in (2). How would you account for 
this difference? 


(1) This is répresénted in six. (i.e. in (6)) 
(2) This is répresénted in six á. (i.e. in (6a)) 


226 Phonology above the word 


15.5.4 The phonological word 


Frequently, when a phenomenon is said to be word-based it is in fact confined to the 
domain of the phonological word, or w (also known as the prosodic word). 
Crucially, the w does not correspond in a one-to-one fashion to the morphological 
word. For instance, while compounds represent ‘words in the sense of morphological 
categories like Noun, Verb or Adjective, in many languages each of the constituent 
parts forms a phonological domain for (consonant or vowel) harmony, word stress 
and syllabification. Thus, VowEL HARMONY in Turkish is confined to the constituents of 
the compound, as shown in (23), from Nespor (1998), where the vowels in one con- 
stituent are all [+back] and the vowels in the other are other all [— back]. In German, 
the MOP (MAXIMUM ONSET PRINCIPLE, cf. Chapter 10) does not apply across the internal 
boundary in a compound, as shown in (24), while in Greek, each of the constituents 
has its own word stress, exactly as if they formed an NP, as shown in (25). 


(23) a. jemek odasw ‘dining room 
b. el jazwiswi ‘handwriting’ 
(24) a. munt.axrt (*mun.ta:rt) ‘dialect (lit. mouth type)’ 
b. ars.loefl (*ar.slæf) ‘ice spoor 
(25) a. omáða eryasias ‘work team (*omada eryasias) 
(cf. meydlia eryasia ‘big works’) | 
b. zóni asfalias ‘safety zone (*zoni asfalias) 


Italian s-voIcING, shown in (26), provides an example of an w domain span rule. It 
voices nongeminate [s] between vowels, as illustrated in (27) (Nespor and Vogel 1986, 
1997). The rule applies in a simplex word in (27a), in a suffixed word in (27b), but does 
not apply across a prefix and its base (27c) or a combination of a stem and a word 
(27d), even though all four items in (27) are single morphological words. The general- 


ization that brings this disparate group of morphosyntactic constituents under one 
heading is the w. 


VCYV 
(26) ITALIAN S-VOICING $-Z/,(... |. -Jo 
(27) a. 'kaza ‘house 
b. ka'zina ‘house + DIM? 
c. aso'tfale *azo'tfale ‘a-social’ 
d. 'filoso'vjetiko *filozo'vjetiko ‘pro-soviet’ 


A constituency effect of the Dutch w was noted by Booij (1985). Dutch coordinated 
NPs allow deletion of identical parts in the coordinated constituents. Schematically, 
the situation can be represented as AB and CB, where the slash marks the deleted 
item. Crucially, the deleted portion in the left-hand coordinate is not a morphological 
(or syntactic) constituent. This is illustrated in (28). The deleted B corresponds to the 
head noun of an NP in (28a), to a noun within a compound noun in (28b), to a verbal 
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stem in (28c) and to an adjectival suffix in (28d). While the deleted portions are quite 
heterogeneous when viewed from a morphosyntactic perspective, their common 
phonological characteristic is that they form separate syllabification domains, i.e. ws. 
Dutch does not generally syllabify across words, while all prefixes and certain suffixes 
like -schap and -achtig do not syllabify together with the base they are attached to, 
forming separate ws. 


(28) a. [[grote]4a; [maten]]yp en [[kleine] ,a; [maten]y] yp 
w( grote ), wl en )w wl kleine ),, œl maten ),, 
‘small (sizes) and large sizes’ 


b. [[minimum], [maten],], en [[maximum], [maten] y] y 
@(minimum),, olen)» .(maximum),, ,,(maten),, 
‘minimum (sizes) and maximum sizes’ 


c. [in [voer]y]y en [uit [voer]y]y 


wlin) wlen)o luit) (voer) 
‘im(port) and export’ 


d. [[zwanger] 4g schap]y en [[moeder]y schap]y 
w( Zwanger),, wl en )o .(moeder),, ,,(schap),, 
‘pregnant(hood) and motherhood’ 


The assumption that the deleted portion should minimally be a w is supported by the 
impossibility of deleting suffixes that do syllabify with the base. Dutch has two 
adjective-forming suffixes meaning ‘like’. The suffix -achtig [aytax] is like -schap in 
(28d), and forms its own w, but the suffix -ig [ax] is incorporated into the w of its 
base. Accordingly, deletion of -achtig is possible in (29a), while in (29b) no deletion 
is possible. 


(29) a. [[paars] 44 achtig] 4g en [[groen] 4g achtig] 4a 
w(Paars),, w(en), (groen),, ,,(achtig),, * (Xru.nax.tax),, 
‘purple(-like) and green-like’ 


b. [[paars] ai igh ag en [[groen] ag ig) agj 
wl paarsig ).. wl en )w wl groenig ),, *u(Xrun )owu(3X)w 
‘purple-like and green-like’ 


Before [r], the Dutch vowels [e:, ø:, 0:] are pronounced as centring diphthongs, i.e. 
[e’, o°, 0°} (Gussenhoven 1993). What is the domain within which MID-VOWEL 
CENTRALIZATION applies? 


[mesr ]y ‘more’  [[me: [reiz]y]yanly ‘with-travel-INF.’ 


(‘accompany’) 
[Xesr Jw ~ ‘smell’ [[murljo:]y [riziko:] 4] y ‘environmental hazard’ 
[[kø?r]y 19 ]w ‘test’ [fkg:]n [rinlnlw ‘cue ring’ 


[ko? rail] y ‘coral’ [nezo:[[re:v]y'a:ns] ag) adj ‘neo-Revian’ 
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Q127 In Dutch, prevocalic [s] is often voiced to [z] after voiced segments, as shown in 
(1). However, no voicing is possible in the examples in (2). What determines when 
[s] may be voiced? 


(1) [[hoeys], farts) dy [z] ‘family doctor’ 


[[vas] aux [a:rdaX] zal ve [z] ‘was friendly’ 
{['mens]y [ap] dy [z]  ‘orang-utan’ 
(‘mis [o:Xst] ly [z] ‘failed harvest (mis-harvest)’ 
[leis]y axtex] aaj [z]  ‘ice-like’ 
(2) fheis}yan]y *[z]  ‘hoist+INF’ 
[kanslwan]jw *[z} “chance+PLUR. 


[masa]lw *[z] ‘mass’ 


15.6 DERIVING PROSODIC CONSTITUENTS 


With the w we have come to the lowest prosodic constituent which can somehow be 
related to morphosyntactic constituency. What the w and higher constituents have in 
common is that at least part of their formation is dependent on the morphosyntactic 
structure of the language. The question of how the relation between the two kinds of 
constituency is to be expressed has received different answers in the literature. In 
Nespor and Vogel (1986), the relation is based on a variety of morphosyntactic proper- 
ties. A typical statement of such a relation might be ‘Include the head of the syntactic 
constituent S, together with all the prosodic constituents of rank C on its non-recursive 
side, in Prosodic Constituent of rank C+1’. (The non-recursive side is the left side in 
right-branching structures, and vice versa.) Selkirk (1986), with reference to a pro- 
posal in an earlier version of Chen (1987), suggests that the unifying element in the 
relation between prosodic and morphosyntactic constituency is reference to edges. Her 
` theory is known as EDGE-BASED PROSODIFICATION, and is given in (30). 


(30)  EDGE-BASED PROSODIFICATION: The right (left) boundary of a prosodic constituent C 
corresponds to the right (left) boundary of a morphosyntactic category X. 


The claim in (30) is that, to derive a prosodic constituent, all we need to know is which 
syntactic constituent it ‘co-begins’ or ‘co-ends’ with, so to speak. As has become clear 
from the discussion in the preceding sections, the role of nonsyntactic factors like 
length tends to be stronger as the rank of the prosodic constituent is higher, and we 
therefore illustrate the theory with a low-ranking prosodic constituent, the Dutch w. 
This constituent can be derived with the help of morphological information only 
(cf. Booij 1977: 103, van der Hulst 1984: 85). As is the case in Italian, suffixes are syl- 
labified with their base, but prefixes never are. This suggests that the w co-begins with 
the beginning of the morphological category ‘word’, i.e. any stem or derived word. 
This excludes prefixes and, as we will see in the next section, certain function words, 
like pronouns and prepositions. This is informally expressed in (31). This description 
predicts that constituents of the compound form individual ws, which is correct. 
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It will be clear that EDGE-BASED PROSODIFICATION is the precursor of Alignment Constraints 
in Optimality Theory. 


(31) œ: Lexical Category, Left 


In (32a) we see that separate syllabification domains are created for prefixes and stems. 
This is because the prefix begins a lexical category (the complex word) and so does the 
base. (The end of a nonfinal w is of course defined by the beginnings of the next.) 
Similarly, (32b) illustrates how separate domains are created for the constituents of 
compounds: each of them begins a lexical category, while the first, additionally but 
redundantly, begins the compound, another lexical category. In (32c) we see that suf- 
fixes are included in the w on the left, because suffixes do not begin lexical stems. 


(32) Morphology w-structure Syllabification 
a. [ont [eixan] yly w (Int), oelxan), ont.el.xan ‘dispossess’ 
[on [e:vən] aaj) Adj (ON) a ETD) on.et.von ‘uneven’ 
b. [[rein]y [atk] y Iw w(Tein),, 4(atk),, rein.atk ‘Rhine barge’ 
[[kerk]y [ey] y Jw w([kerk),, (el), kerk.ceyl ‘barn owl 
c. [[tetkon]y nly w(terkanin),, te:.ka.nin ‘drawing’ 
[[vandal] y air], (vandalair),, van.də.la:r ‘walker’ 


As pointed out by Inkelas (1989) and Booij (1996), prosodic structure, like any other 
aspect of the phonological representation of words or morphemes, can be included in 
underlying representations. As we saw in (28d) above, many Dutch full-vowelled suf- 
fixes are not included in the w on their left, like the nominalizing suffix -schap and the 
adjectival suffix -achtig, which do not syllabify with their base. In the lexicon, these 
suffixes will therefore be listed as ws. (It would, incidentally, not be difficult to give a 
phonological characterization of the suffixes that display this behaviour.) 


15.6.1 Clitics 


Pronouns, auxiliary verbs, conjunctions and the like cannot be given w status by (31) 
in Dutch, since they do not belong to a major word class. But since they are words, not 
affixes, they cannot attach to some other item in the lexicon that is a major-class item. 
Many function words are in fact included in w postlexically. If (31) is also valid 
postlexically, the prediction is that function words in Dutch should behave like suf- 
fixes, i.e. be included in the phonological word to their left. This is indeed what we 
find. In (33a) the article [an] encliticizes onto the preceding verb form [rip] ‘called’. 
This explains why the article cannot, in natural speech, be pronounced [?an] in this 


context. The same goes for the preposition [1n] and the definite article [at] in (33b) 
(Booij 1996). 


(33) a. [rip]y [an] art Hij ,,(ri.pon),, kat 
‘he called a cat’ 
b. [Xaitjv [In] prep [tare Het ,,(xat.ti.not),, putje 
‘it goes into the drain’ 
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Other function words, also those that lack an onset, have a full vowel, like the conjunc- 
tions [of] ‘or and [en] ‘and’. These words are at best only variably syllabified with the 
preceding word, and they will therefore have to be given w status in the lexicon, along 
with the suffixes that form their own @, like -schap. It has also been noted that certain 
function words and affixes behave neither as elements that are included in the same w 
as their host word nor as elements that form an w by themselves. Nespor and Vogel 
(1986) postulate the Clitic Group as a constituent between the w and the ¢ in order to 
account for phonological processes that occur between words and such recalcitrant 
morphemes, but which fail to occur in other morpheme combinations. An example is 
the prestressing of Latin -que ‘and’, discussed in Chapter 14. 


15.6.2 The syntactic residue 


The U, IP, p, œ and, perhaps, the Clitic Group are the prosodic constituents which define 
the relevant domains of processes that apply above the word level. Prosodic theory thus 
distinguishes itself from theories that claim that such rules can refer directly to syntac- 
tic structure, such as Kaisse (1985). Nevertheless, instances have been found of rules 
that apply across words, which do apparently refer to syntactic categories, as would 
appear to be the case for French LIAISON (Hayes 1990, Post 1997). Hayes (1990) proposes 
that such ‘residual’ syntax-sensitivity should be accounted for in the lexicon. That is, the 
phonological rules that produce the required forms are in fact lexical rules, and the 
forms they produce are thus available in the lexicon, ready for insertion into syntactic 
phrases. To give an example, the phonological rule that shortens final long vowels in 
Hausa verbs is syntactically conditioned: it only applies if a direct object that contains 
a major class noun immediately follows. This is illustrated in (34a), which contrasts 
with (34b), where the morphosyntactic condition is not met. 


(34) a. ná: ká:mà ki;fi: ‘I have caught a fish’ 
b. ná: ké:ma: fi ies “I have caught it 
na: karma: wa muisd: ki:fi: ‘I have caught Musa a fish’ 


The lexical rule is given in (35); its morphosyntactic conditioning is expressed by the 
‘Frame’ given below it. (Because the only category that can occur initially in a VP 
before an NP is a V, it is not necessary to label the word as a Verb in the rule.) Thus, 
when a verb is to be inserted in a sentence, the more specific form produced by FINAL 
VOWEL SHORTENING is chosen if the morphosyntactic condition applies. 


(35) FINAL VOWEL SHORTENING V > Ø / [. . . V 
Frame 1: [— NP . . .jvp 


— ] Frame 1 


The assumption that syntax-dependent rules are in fact lexical rules puts such altér- 
nations in a comparable position with phrasal allomorphy of the sort that is seen in 
the English indefinite article, which is [an] before vowels but [a] elsewhere. As the 
name suggests, phrasal allomorphs are rival phonological forms whose distribution is 
governed by properties of the surrounding words. There are, however, two differences 
between these two cases worth mentioning. First, the forms in (35) are generated by a 
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rule, because they involve a whole class of words rather than a single morpheme, and 
second, in (35) the conditioning is morphosyntactic rather than purely phonological. 
Hayes refers to forms like English [an] and Hausa [ka:ma] as precompiled, the idea 
being that they come ready-made from the lexicon. A prediction of this treatment is 
that rules like (35) might have exceptions. 


Q128 Demonstrate that the occurrence of English [an] is purely phonologically determined. | 


Q129 Why can syntax-dependent alternations have exceptions in Hayes’ theory of 
precompilation? 


15.7 PROSODIC CONSTITUENCY BELOW THE PHONOLOGICAL WORD 


Below the w, the prosodic hierarchy could be argued to continue with the foot (F) 
and the syllable. However, these constituents cannot, as a rule, be derived from mor- 
phosyntactic constituency, and are thus only subject to phonological well-formedness 
conditions. Another difference with the prosodic constituents discussed so far is that 
the SLH (7) does not apply with the same force to constituents below the word. In par- 
ticular, stray monosyllabic syllables need not be included in foot structure, as we saw 
in Chapter 14. In English an initial, open prestress syllable like [ta] in Tamara neither 
belongs to the foot mara nor, in view of its obviously unstressed character, forms a foot 
by itself. As as result, the syllable is directly attached to the w node, as shown in (36), 
a clear case of nonexhaustiveness (Pierrehumbert and Beckman 1988: 148). In Dutch, 
such initial prestress syllables remain unfooted regardless of whether they are closed 
or not, so that both [mi'nyt] ‘minute’ and [kan'to:r] ‘office’ have an unfooted initial syl- 
lable (Gussenhoven 1993). 


(36) > 


tə md: rə 


Likewise, ambisyllabic representations can be interpreted as involving improper 
bracketing: the ambisyllabic consonant is in the coda of one and in the onset of another 
(Kiparsky 1979, Jensen 1993). Even if the violation occurs below the level of the syl- 
lable, one might argue that the syllable is at least ‘guilty by association’. On the other 
hand, the relation between segments and syllable affiliations might be viewed as being 
in the ambiguous transition zone between ‘association lines’ (which routinely enter into 
many-to-one linkings) and the branches of tree structures that indicate constituent 
membership. There thus seems no good reason for rejecting ambisyllabic representa- 
tions, certainly when considering the advantages of this representation in Chapter 10. 
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Indeed, in addition to the postlexical ambisyllabicity in English dealt with in that chap- 
ter, there are further considerations for assuming that ambisyllabicity may even exist in 
lexical representations. Dutch and German foot structure, for instance, leaves no doubt 
that intervocalic single (superficially non-geminate) consonants, like [t] in Dutch 
[kon'feti] ‘confetti’, close the syllable on their left (as well as open the syllable on their 
right). One reason for assuming they are in the coda is that the main stress never falls 
to the left of a closed penultimate syllable, and that the penultimate syllable of a word 
like [kon'feti] behaves just like a closed penultimate syllable in a word like [a'.las.ka:] 
‘Alaska’ in this respect (*['a:.las.ka:], *['konfeti]). In these cases, the consonant is 
ambisyllabic in representations that exist ‘before’ foot structure is erected and crucially 
contributes to the weight of the syllable (van der Hulst 1985). 


15.8 CONCLUSION 


Phonological rules that apply above the level of the word appear to be constrained 
by prosodic constituents that form a hierarchical structure which is not directly 
mappable onto the morphosyntactic structure. We have seen that these prosodic 
constituents not only define the domains at or within whose edges phonological 
adjustments occur, but are also constituents that are referred to by rules of syntax. Just 
as syllabic and foot structure is available in the lexicon (cf. Chapter 8), so the prosodic 
structure appears to be visible to the syntax. That is, morphosyntactic structure exists 
simultaneously with the phonological structure. The distinction between lexical and 
postlexical rules, which separates phonological rules that can refer to morphological 
information and which potentially have exceptions from those that can only refer to 
phonological representations and which cannot have exceptions, could be maintained 
above the level of the word by assuming that phonological rules for which syntactic 
information is relevant are in fact lexical rules. Such syntax-sensitive rules only appar- 
ently apply above the level of the word, and actually produce the appropriate alter- 
nants in the lexicon. During the construction of the sentence, these precompiled forms 
are inserted in the specific contexts instead of the rival, more general alternant. 

Prosodic structure is considerably more orderly than syntactic structure. The Strict 
Layer Hyporuesis, which forbids improper bracketing, recursivity and non-exhaustiveness, 
constrains the prosodic structure from the w onwards. The phonological constituents 
below the w are not usually derivable from morphosyntactic structure, and may 
deviate from these well-formedness requirements in limited ways. In particular, not 
all syllables need to be included in foot structure, while consonants may be simulta- 
neously part of two syllables, a representation known as ambisyllabicity. 
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Writing a textbook can be a dangerous enterprise. Especially in a fast-developing dis- 
cipline such as phonology, there is always the risk that the book is outdated at the 
moment of publication. We have tried to give our textbook a longer life not only by 
providing an overview of the state of the art but also by presenting the arguments that 
have led to changes in theoretical frameworks, while constantly trying to keep in mind 
the kinds of question that a beginning student of phonology might ask. By trying to 
make this book meet the beginning student’s needs, we have also felt free to deviate 
from a strict chronological treatment of theories. For instance, at an early point, when 
we were dealing with the nativization of loanwords, we introduced the latest version of 
phonological theory: Optimality Theory. This same didactic goal characterizes most 
of the exercises. Rather than rehearsing the material of each chapter, they are designed 
to make students sensitive to linguistic arguments, and more often than not present 
new data and new questions. 

The history of generative phonology can be divided into three main periods. Starting 
with the idea that sound can be linearly sliced up into discrete matrices of segments, the 
main emphasis of linear phonology from the early 1960s has been on the exact formu- 
lation of rules that manipulate strings of segments. Chapters 6 and 7 present an 
overview of this linear framework of phonology. As so often in the development of sci- 
ence, the joint effort of many researchers quickly led to new insights. Starting in the late 
1970s, phonologists have convincingly shown that phonological theory is in need of an 
enriched representation of phonological structure. The replacement of linear phonology 
by nonlinear models greatly increased our understanding of phonological structure and 
at the same time reduced the number of possible phonological rules. As such, it offers a 
more adequate theory of possible phonological processes. The developments and argu- 
ments that led to nonlinear phonology form the topics of chapters 9-15. In the early 
1990s, when nonlinear models were being consolidated, nonlinear theory appeared well 
equipped to return to the issue of how to deal with phonological adjustments. 

During the first two periods of its history, phonology remained derivational: rules 
changed underlying structures into surface structures. With the advent of Optimality 
Theory that picture, too, changed. The phonological rule itself has been abandoned and 
replaced by a universal set of constraints that may be ranked on a language-specific 
basis. The ranking of constraints decides what will be the surface structure for any 
given input form in a given language. 
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Ourhopeis that students who have studiedthis book will feel we have Succeeded in 
giving them a sound understanding of phonology, and will have come to share with us 
the excitement of realizing that nontrivial questions about sound structure can be 
formulated and answered. We also hope that it will have provided them with the right 
kind of background wherever scientific questions have a bearing on the sounds of lan- 
guage, as will often be the case in psycholinguistics, speech technology, neurolinguis- 
tics and sociolinguistics. And, fortunately, for those who want more there are excellent 
further textbooks available: Phonology in generative grammar by Michael Kenstowicz, 
The handbook of phonological theory, edited by John Goldsmith, John McCarthy’s 
A Thematic Guide to Optimality Theory and Optimality Theory by René Kager. 
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Chapter 2 


Q1 In unscratchable there is a verb scratch, the base of scratchable. In road tax 
increase there are the nouns road, tax, increase, road tax and road tax increase. That is, 
its structure is: 

will road ]wx[ tax ]yly nl increase ]y]y. 


Q2 There are six NPs: He, the letter, a pen he bought in Italy, a pen, he and Italy. In 
addition there is the deleted NP that (A pen that he bought in Italy), a relative pronoun 
that takes the place of the Direct Object of bought, viz., a pen. 


Q3 If the vocalizations that represent morphemes did not have structure, there 
would be no segmental structure, and consequently there would be no segments that 
could be switched round. For the relevance of speech errors for phonological struc- 
ture, see Levelt (1989: ch. 9) and Shattuck-Hufnagel (1992). 


Q4 Yes, sign languages for the deaf have morphosyntactic structure as well as phono- 
logical structure, just like spoken languages. The difference is that the medium in which 
the phonological units are expressed is not human vocal sound, but gestures. There are 
many sign languages in the world, and there is no genetic relationship between the dom- 
inant sign language and the dominant spoken language in any community. They are 
entirely comparable functionally and in terms of processing speed. A person who is deaf 
at birth and does not learn.a sign language will be linguistically and cognitively deprived 
in the same way as any hearing person would who is artificially prevented from learning 
a spoken language. The haunting story of the slow pace with which this insight spread to 
the community of therapists is told in Sacks (1989). It is only recently that research into 
the morphosyntactic and phonological structure of sign languages has got off the ground. 


Q5 
1. The word for ‘field’ begins with [6] because according to FRICATION a word-initial 


plosive like [d] is not tolerated when the word occurs in non-initial position in 
the sentence. 


2. ‘Cows are grazing in the field’ in language II is [ni xaputi fula]. 


3. Since fricatives would uniquely occur at the beginning of non-initial words, 
any occurrence of a fricative would signal the beginning of a word. Detecting 
the word boundaries in speech is a nontrivial task for the hearer. It must be 
realized that there are no pauses between words, as should be readily evident 
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to anyone listening to unfamiliar languages. For more information on the rele- 
vance of phonological structure for parsing strategies, see Cutler (1995) or 
McQueen and Cutler (1997). 


Q6 [çibatfi]; [tswkemono]; [ifi]. The devoicing of the close vowels amounts to their 
deletion, but notice that the effect close vowels have on the articulation of [t,s] is pre- 
served. In Chapter 6 the issue of rule ordering will be discussed. 


Q7 Syllables Morphemes 
[e], [lo], [fant] ` [elafant] 
[parm, [oul] [pa:m, [oil] 
[an], [set], [lin] [an], [setl], [19] 

Q8 


1. No, [ikt] is a well-formed syllable, and occurs as a rhyme in licked, nicked, 
strict, etc. There just so happens not to be a morpheme with that shape. 


2. The sequence [mrat] does not represent a well-formed English syllable. 


1. (C(C))V(V). 


2. Yes, as in the second syllable of ‘lizard type’, for example. 


1. To answer the question for our own native language, English has [6], which is 
absent from Dutch, while Dutch has [y], unlike English. 


2. The final consonant of German Kiev will be [f] because of FINAL DEVOICING. 


1. The total number of segments in !X@ is very large indeed. 


2. The number of vowels in Pawaia is large compared with the small number of 
consonants. 


3. The number of vowels in Haida is small compared with the large number of 
consonants. 


4. While the number of consonants of Norwegian is close to the world average, 
the number of vowels is rather large. 


Q12 The smaller the number of segments, the smaller will be the number of contrasts, 
and the sooner the need arises to add further segments. For the same reason, binary 
numbers (numbers that consist of 1 and 0 only) will be longer than decimal numbers as 
soon as their value exceeds 2. 


Q13 1. The voiced uvular stop will be most likely to be absent, because the space 
between the uvular closure and the glottis is even smaller than that between the velar 
closure and the glottis. 

Q14 


1. Your graphs should look like two parallel inverted Vs, with the lower one rep- 
resenting the voiced plosives. 
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The [p] and the [g] have lower positions than in the theoretical graphs. The rea- 
sons for this must be the same as those that were given for the relative frequency of 
the system gaps for [p] and [g] in section 2.4.4: the first is relatively hard to hear 
because of the absence of a resonance chamber in front of the place of articulation, 
while the second is relatively hard to produce because of the rapid increase in air 
pressure in the small chamber between the velar closure and the glottis, which will 
impede the air flow through the glottis needed to allow the vocal folds to vibrate. 


Chapter 3 


Q15 *[tu?a] contains the illegal segment [t], *[plai] has an illegal CC onset, while 
*[kehunanal] has an illegal coda in the last syllable. 


Q16 *[men] does not have the required two syllables, while *[le?em] ends in [m], 
which is not allowed in Konjo codas. 


Q17 Before [w], [t] affricates to [ts]. If [w] were used, therefore, the word for ‘toast 
would be [to:swtsui], which output is less similar to the English word than [to:swto]. 
(There are exceptions to this strategy, since ‘donut is [do:natsw].) 


Q18 


ls 
2. 


Q19 


The only consonant that can appear word-finally in Japanese is [n]. 


The consonants that can appear in the coda of non-final syllables are [m,n,n]. 
(Japanese also has intervocalic geminate consonants, which simultaneously 
function as coda of one syllable and onset of the next.) 


A vowel is added after the word for‘rubber’, but not after the word for ‘manganese 
because word-finally only [n], not [m], is allowed as a coda consonant. 


On the basis of these data, [s] and [h] can be said to be the voiceless fricatives 
of Japanese. 


The fact that the onset clusters [bl-] and [dr-] in the words for ‘tin’ and 
‘hypochondria are broken up into separate syllables suggests that Japanese 
has no complex onsets other than C[j]. 


. The onset clusters [br-]and [fl-] and the coda cluster [-It] are broken up into 


different syllables. 


Clusters that are not broken up, that is, are not fully interpreted, are the onset 
clusters [st-] and [sk-] and the coda clusters [-nd] and [-nt]. 


Hawaiian [p] is used for English [b,f,p]. These consonants are all articulated 
with the lips. 


Hawaiian [k] is used for English [t,d,s,f,z,0,k]. These consonants are all artic- 
ulated with the tongue. 


We feel uncertain of what the vowels would be, but the consonants would be 
[p,l,k]. [palaki] and [poloka] are among the possible forms. 
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Q20 


Q21 


Q22 
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The sound [s] is not part of the segment inventory of the language, which is 
why it is replaced by a native segment. It is improbable that the fully native 
speaker of Hawaiian cannot hear the difference between [s] and [k]. Speakers 
may feel that the difference is somehow not important, much in the same way 
as speakers of Dutch or German can hear the difference between English bet 
[bet] and bat [bæt], but nevertheless pronounce both as [bet]. 


. The constraint in (11), Max-IO (deletion of segments is prohibited) is respon- 


sible for ruling out [kis]. 


Max-IO has to be ranked above Dep-IO because otherwise [kıs] would become 
the optimal output form. 


[io [io 
ei 
tier | fe 


Vice [1 [ana 


. Dep-IO is not crucially ranked above CopaDonp, as the same form, [le?en] 


would still be the optimal output form. 


| tem | ALIGN-STEM-RIGHT 


| | 
ml a 
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2. Yes, CopaConn is crucially ranked above Ipent(F), because the reverse ranking 
would produce [le?em] as the optimal output form. 


Chapter 4 


Q23 
1. In Southern Swedish, [r] and [r] are allophones of the same phoneme, because 
their distribution is determined by the phonological context. 


2. In Dutch, they are stylistic variants, because their distribution is determined 
by the formality of the situation. 


3. In Provencal, they are different phonemes, because they can be used to form a 
minimal pair. 


Q24 In Tolitoli, the retroflex lateral flap [L] occurs after back vowels ([0,u,a]), 
whereas [I] occurs after front vowels ([i,e]) and word-initially. 


Q25 The flap [r] occurs in the onset, whereas [1] occurs in the coda in Korean. 


Q26 Old English [ç] occurs after front vowels (in the examples, [i,i:,e:,2:]), whereas 
[x] occurs after the back vowels (in the examples, [u:,o1,9,a:], and [a] as the second 
element of a diphthong). 


Q27 Sew and sewage are unrelated, both semantically and phonologically. Blow and 
blew are arguably derivable from the same UR, but since the number of forms that 
would go by the rules that required to transform [blou] into [blu:] is very small (cf. 
grow, know, throw), it would seem more reasonable to list the past form. For a deriva- 
tional description, see Halle and Mohanan (1985). The same consideration applies to 
the different verbal and nominal stems -ceive and -ception. There is not even a remotely 
reasonable case to be made for brother — fraternal (cf. the different phonological rela- 
tions between mother - maternal and father - paternal). Cork - corkage are regularly 
related phonologically via attachment of the nominal suffix [1d3], though the form 
needs to be listed in the lexicon because of its specialized meaning ‘fee to be paid for 
opening the guest’s own wine in restaurants without a licence to sell wine’. The forms 
talk and talked indubitably contain the same UR: the phonology is regular and the 
semantics of the inflected form is predictable from its morphological structure. 


Q28 
1. Iflist were [lis] underlyingly, its present participle would be *['listn] instead of 
the correct [TstI]. 


2. Its past participle form would be [list], produced by pevoicine of [d] after the 
voiceless obstruent [s]. If [list] is assumed as the UR,3-INSERTION would insert a 
schwa between the suffix [d] and the stem, preventing DEVOICING from applying. 


Q29 


1. The alternants of each prefix in Balantak are [Sa-], [sam-], [san-], [san-] and 
[to-], [tom-], [ton-], [ton-]. 
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After [sa-], the initial segment of the base is [w,1,n,ņ,m,r], and after [to-] it is 
[j,w,l,n,r]. mm 

After [sam-], the initial segment of the base is [b], and after [tom-j it is [b,p]. 
After [san-], the initial segment of the base is [t,s], and after [ton-] it is 
[t,d,s]. 

After [san-], the initial segment of the base is [k,g], or a vowel, just as it is after 
[ton]. 


The underlying forms must be /san, ton/. 


A deletion rule is required to delete [1] of the prefix before a base starting with 
a sonorant consonant ((n,p,m,l,r,j,w]) and a place assimilation rule is 
required to assimilate the place of articulation of the prefix-final [1] to that of 
the base-initial consonant. Before vowels nothing needs to be done, which 
would not be the case if different URs were chosen. With /tom, sam/ or /ton, 
san/, the assimilation to [9] would have to be effected in the unnatural context 
‘before dorsal consonants or a vowel’. If /sa, to/ were chosen, an insertion rule 
would be required to insert [ņ] before the unnatural context of ‘obstruents and 
vowels’. 


Chapter 5 


Q30 


1. 


The answer to the first three questions is [p,t,k]. 


2. Evidently, [p,t,k] form a natural segment class. 


Q31 


1. 


Q32 


With two binary features F and G, eight natural segment classes can be referred 
to. Four classes are defined by using one binary feature with each possible 
value: +F, —E +G, —G. An additional four are defined by using two features 
with every possible value combination: +F+G, +F—G, —F+G, —F—G. 


The combination of [十 cons, 一 Son] defines the class of obstruents ([p,f,t], etc.), 
[十 cons, 十 Son] defines the class of sonorants ([n,r,l], etc.), [—cons, —son] 
defines the class of laryngeal segments ([h,fi,?] and [—cons,+son] defines 
the class of glides ([j,w,v], etc.). 


In Dutch, [a] is inserted between the noun stem and the diminutive suffix, 


when the noun stem ends in a [十 cons, 十 Son] segment, that is, after the class of sono- 
rants excluding glides and vowels. If we referred to only [+son], the glide [j] would 
wrongly be included. 


Q33 In Cordoba Spanish, the assimilation process creating geminates applies to 
[ +cons] segments. Glides and laryngeal segments, | — cons] segments, do not trigger it. 


Q34 The class of sounds subject to degemination in Dutch can be characterized by 
the feature [+ cons]. 
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Q35 All obstruents are voiceless at the end of a syllable in Dutch. This class can be 
characterized by the features [+ cons, —son]. Since [h] cannot occur word-finally in 
Dutch, the specification [一 son] would suffice. 


Q36 


[二 son] 


m n 1 


[一 approx] 


[十 approx] 
[十 cons] 


[— cons] 


Q37 Segments that may appear after [sp,st,sk] in English, [l,r,j,w] and the vowels, can 
be characterized by the feature [十 approx]. 


Q38 The plosives acquire the feature [+ constr]. 


Q39 In Southern Oromo, the combination of the features | +constr, — voice, + cons] 
is necessary to distinguish [t’] from the three consonants that do not trigger 
i-EPENTHESIS. The feature [+constr] is necessary to distinguish ejective [t’] from 
[t], [一 voice] is necessary to distinguish it from [d] (which is [+ constr] as well), 
and, finally, the feature [+cons] is necessary to distinguish [t’] from the glottal 
stop [?]. 


Q40 The class of segments before which the long vowels appear in Scottish English 
can be characterized as [+ cont, 十 voice]. 


Q41 
1. The devoicing applies to [b,d,g,d3], but not to [z]. 


2. The group of plosives and affricates is [—cont]. Fricatives are [+ cont]. 


Q42 The class of vowels undergoing the nasalization process in Corsican can be 
referred to by the feature [—high] as the class of non-high vowels. 
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Q43 The class of consonants triggering r-DELETION in Bengali can be characterized by 
the feature [corona]. 


Q44 The feature [back] is involved in the uMLAUT process in the Dutch dialect of 
Wehl: it is [一 back] in the plurals. i 


Q45 
1. The complementary distribution of [r] and [I] is determined by the preceding 
vowel. The [I] occurs after a back vowel and [r] occurs after a front vowel. 
[I] also occurs word-initially. 


2. The context of [r] can be stated as ‘after a vowel with the feature [—back]’, 
whereas the context for [1] cannot be expressed in terms of distinctive features. 
Although the word-internal occurrence of [1] can be expressed as ‘preceded by 
a vowel with the feature | +back]’, there is no feature to refer to word-initial 
position. 


3. If/l/ is chosen as the underlying representation a single rule can account for the 
surface complementary distribution pattern: /l/ becomes [r] after a front vowel. 


1. [+spread] 

2. [CORONAL, — cont, — son] 

3. [LABIALD, 十 voice, 一 cont] or [LABIAL, 十 voice, 十 cons] 
4 


. [一 cons] ([+son] is superfluous, as there are no laryngeal segments in the 
language.) 


5. [十 back, —high] 


1. [+cont,—son] (Recall that [h,hw ] are not specified for [+ cont], p. 67.) 

2. [十 constr] 

3. [CORONAL, — anterior, — son] 

4. [coRoNAL, 十 nas] 

5. [十 back, 十 round] (We must not include the rounded consonants, and assume 


the class of V or [+syll].) 


Chapter 6 


Q48 Rule (6) applies to both [d]s. At the first application, the first [d] corresponds to 
the focus [—son] and the second to Co and at the second application the second [d] 
matches the focus [— son], while the term Co corresponds to no consonant. 


Q49 Rule (1) says ‘Insert a schwa before a word-final postvocalic [r]’ and rule (2) 
says ‘Delete [r] before consonants or at the word end’, 
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Q50 Your answer should consist of four copies of rule (10). In one of them, a and 
B are replaced with + and +, respectively, in a second with + and —, ina third with 一 
and +, and in the fourth with — and —. 


Q51 


L 


Q52 


Q53 


The rule says that word-final mid vowels are raised, that is, word-final [e] and 
[o] become [i] and [u]. 


. This rule states that a front vowel is deleted before another vowel, if the two 


vowels have the same height. That is, [e] is deleted before [e] and [o], [i] is 
deleted before [i] and [u] and [æ] is deleted before [æ], [a] and [9]. 


. The rule says that a coronal voiceless stop is inserted between [n,n] (a coronal 


nasal) and [s, f] (a coronal voiceless fricative) in word-final position. That is, 
word-finally, [t] is inserted between [n] and [s] and [c] between [p] and [f]. 


. [-son] 一 [—voice] / [—son] (# #) 一 


E a| +cons |- le 


+approx —son 


+high 
V = [aback]/ Feel Co 十 一 


Instead of just Co also Co), is correct, but the reference to the syllable is implicit 
in Cp, since it stipulates that no further vowels may appear after the focus V of 
the rule. 


vaim vain 
N-DELETION = 3-INSERTION 3 
3-insertion 9 N-DELETION ø 
Output [vairan vaira] 


. underlying forms morphemes 


we 3rd-PL-OBJ (‘them’) 
n PROGRESSIVE 
o? 3rd-SG-SUBJ (‘he’) 
0 3rd-SG-OBJ (‘it has no phonological form) 
netale ‘lick 
picena ‘cut’ 
Inetale+o?/ = /we+picenat+n+o?/ /we 十 picena 十 o?/ 
CONTRACTION netleo? wepcenano? wepcenao? 
TRUNCATION netle? n.a. wepcena? 
[netle?] [wepcenano?] [wepcena?] 


‘he licks it ‘he is cutting them’ ‘he cuts them’ 
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Q55 


l: 


Only the last order derives the correct surface forms [jomo] and [nguja]. 


Q56 


l. 


Key to questions 


Rule 1 could be termed VOICED PLOSIVE DELETION, rule 2 POST-NASAL VOICING and 
rule 3 PLACE ASSIMILATION, but more or less explicit names would also be 


correct. 


(1) VOICED PLOSIVE DELETION 
(2) POST-NASAL VOICING 
(3) PLACE ASSIMILATION 


(1) VOICED PLOSIVE DELETION 
(3) PLACE ASSIMILATION 
(2) POST-NASAL VOICING 


(2) voICING 
(1) VOICED PLOSIVE DELETION 
(3) PLACE-ASSIMILATION 


(2) POST-NASAL VOICING 
(3) PLACE ASSIMILATION 
(1) VOICED PLOSIVE DELETION 


(3) PLACE ASSIMILATION 
(2) POST-NASAL VOICING 
(1) VOICED PLOSIVE DELETION 


(3) PLACE ASSIMILATION 
(1) VOICED PLOSIVE DELETION 
(2) POST-NASAL VOICING 


PROGRESSIVE DEVOICING should devoice fricatives after a voiceless obstruent and 
REGRESSIVE VOICING should voice obstruents preceding a voiced plosive. The order 
depends in part on how explicit the formulation of the rules is. With the least 
explicit formulations, the order iS FINAL DEVOICING, PROGRESSIVE DEVOICING, REGRESSIVE 


VOICING and DEGEMINATION. 


/n+gomo/ 
nomo 

n.a. 

n.a. 

* [nomo] 


/n+gomo/ 
nomo 

n.a. 

n.a. 

* [nomo] 


/n+gomo/ 
n.a. 

nomo 

n.a. 

* [nomo] 


/n+gomo/ 
n.a. 
ngomo 
nomo 
[pomo] 


/n+gomo/ 
ngomo 
n.a. 

nomo 
[pomo] 


/n+gomo/ 
ngowo 
nomo 

n.a. 
[pomo] 


/n+kuja/ 


nguja 
[pguja] 
/n+kuja/ 
n.a. 
nguja 
nguja 
[pguja] 
/n+kuja/ 
nguja 
nuja 

n.a. 

* [nuja] 


/n+kuja/ 
nguja 
nguja 
Duja 

* [puja] 
/n+kuja/ 
pkuja 
nguja 
guja 

* [puja] 
/n+kuja/ 
Dkuja 
nkuja 
nguja 
[pguja] 
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Berd — [-voice] / [—voice](#) 一 


a PROGRESSIVE DEVOICING | 


‘ —cont 
REGRESSIVE VOICING: [—son] 一 [十 voice] / — | 
It is unnecessary to specify the focus of PROGRESSIVE DEVOICING as | + voice], since any 
[一 voice] fricatives would retain their specification as [一 voice]. That is, the rule 
would apply without effect, i.e. vacuously. Second, in the right-hand context of 
REGRESSIVE VOICING, instead of [一 cont] also [— son] could be specified, if we assume 
that PROGRESSIVE DEVOICING applies first, which rule leaves only voiced plosives 
behind in the context concerned. 


3: /vand#te:yal/ /lup#zæyvər/ /ka:z#za:k/ 
FINAL DEVOICING vant#teryal n.a. ka:s#zatk 
PROGRESSIVE DEVOICING n.a. lup#sceyvar kars#satk 
REGRESSIVE VOICING n.a. n.a. n.a. 
DEGEMINATION van#te:xal n.a. ka:#satk 
[vanteryal] [lupsceyvar] [ka:sa:k] 
/kop#bal/ /le:z#bril/ 
FINAL DEVOICING n.a. le:s#bril 
PROGRESSIVE DEVOICING n.a. n.a. 
REGRESSIVE VOICING kob#bal le:z#bril 
DEGEMINATION ko#bal n.a. 
[kəbal] [le:zbril] 


Q57 In Kaatsheuvel Dutch, the diminutive form of the river Linge will be [linaka], 
given that the final schwa, contrary to the epenthetic schwa, is already present when 
S-INSERTION (37) applies. 


Q58 The rules FLAPPING and pRE-FORTIS-CLIPPING stand in a counterbleeding order. The 
reverse order would decrease the number of forms to which pre-FoRTIS-CLIPPING (24) 
could apply. 


Chapter 7 


Q59 
1. The diminutive form of [flam] is [flamoca]. If PLACE ASSIMILATION applied 
before 9-insertion, [flam+tjo] would incorrectly end up as [{flampja], as 


shown. 

/flam+tjo/ /flam+tjo/ 
9-INSERTION flam+atja PLACE ASSIMILATION flam+pjo 
DEGEMINATION - 9-INSERTION - 
t-DELETION - DEGEMINATION 一 
PLACE ASSIMILATION - t-DELETION = 
PALATALIZATION flam+əcə PALATALIZATION 


Output [flamaca] *[flampja] 
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2. The diminutive form of [fest] is [fe:fə]. If t-pELeTIon applied before DEGEMINATION, 


/ferst +tja/would incorrectly end up as [fe: fca], as shown. 


/fe:st +tja/ /ferst +tja/ 
-INSERTION 一 3-INSERTION - 
DEGEMINATION fe:s+tjo t-DELETION fe:st+jo 
t-DELETION fe:st+jo DEGEMINATION - 
PLACE ASSIMILATION - PLACE ASSIMILATION - 
PALATALIZATION fe:fa PALATALIZATION fe:fca 
Output [fe:fa] *[fe: fca] 
3; /hemt+tjə/ /hemt +tjo/ 
9-INSERTION - 3-INSERTION 一 
DEGEMINATION hem@+tja PLACE ASSIMILATION vac. 
t-DELETION - DEGEMINATION hem+tjə 
PLACE ASSIMILATION hem+pjə t-DELETION - 
PALATALIZATION - PALATALIZATION hemca 
Output [hempjo] *[hemca] 


Q60 Given that the laterals [1] and [A] in Sittard Dutch do not trigger k-rrontiNG, they 
should be considered [+ cont] segments. If they were [— cont], they would trigger k- 
FRONTING. . 


Q61 The difference between Wehl Dutch and the standard language is that in Wehl 
Dutch 9-INSERTION apparently only applies if the sonorant in the structural description 
of the rule (cf. rule (6)) is [coronat]. 


Q62 
1. In the dialect of Utrecht, the alternants of the diminutive suffix are [-yi], 
[-oxi], [-i], [-si], [-tsi], [-pi] and [-ki]. 


2. All the rules in (13) (3-INSERTION, DEGEMINATION, t-DELETION and PLACE ASSIMILATION) 
except PALATALIZATION exist in Utrecht Dutch. 


3. The required rules are arrrication, by which [t] becomes an affricate [ts], and a 
rule by which [t] becomes [x], which we will term FRICATION. If FRICATION is 
ordered between 9-INSERTION and DEGEMINATION and if AFFRICATION is ordered last, 
the rules can be formulated as follows: 


FRICATION [t] > [xy] /V—V 
AFFRICATION [t]—> [ts]/ — i] yy 


4. ei-ti rok-ti bom-ti raim-ti boxt-ti fut-ti ~ bort-ti 
9-INSERTION 一 = bəm-əti - 一 - - 
FRICATION el-Xi - bam-axi - - - - 
DEGEMINATION - - = 一 box-ti fu-ti bor-ti 
t-DELETION - rək-i - - box-i - - 
PLACE - - - raim-pi - - 一 


ASSIMILATION 
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AEFRICATION - - - - ee futsi bortsi 
Output [sixi roki bmaxi raimpi bəyi futsi bortsi] 


Q63 Sittard Dutch [benca] could be derived from /bepc-ka/, /bent-ka/, 
/bep-ka/, /banc-ka/, /bant-ka/ or /ban-ka/. The last three possibilities will be identical 
to the first three after UMLAUT (rule (15)), while the further derivation will proceed as 
illustrated in (19) for /men-k3/ and in (20) for /lepc-ka/ and /tent-kə/. It could not be 
derived from /ben-kə/ (or /ban-ka/), because n-AssIMILATION bleeds k-FRONTING, as illus- 
trated in (19) for /pan-ka/. 


Q64 1. S-INSERTION needs to apply before 3-INSERTION, because otherwise /slan-ko/ 
would incorrectly end up as *[slanaka] instead of [slanaska]. Vowet Laxinc needs to be 
applied after 3-INSERTION, because otherwise /o:r-ka/ would first become /or-ka/ and 
then be subject to 9-INSERTION, ending up incorrectly as *[oraka] instead of [ərkə]. 


2: /slan-ka/ /snor-ka/ /our-ko/ 
S-INSERTION slan-ska 一 - 
3-INSERTION slan-aska snor-aka 一 
VOWEL LAXING 一 - or-ka 
Output [slanasko] [snoraka] [arka] 
Chapter 8 
Q65 


1. The lexical representation of ‘breakout is [œytbra:k], the form prior to the 
application of postlexical REGRESSIVE VOICING. 


2. Because it is a postlexical rule, REGRESSIVE VOICING would not be expected to have 
exceptions, which prediction is correct. 


3. Native speakers would say the last consonant of the prefix is [t], a judgement 
which they are expected to base on the lexical representation. 


Q66 
1. PLAIN PLOSIVE VOICING is not structure preserving, because its products, voiced 
plosives, do not belong to the lexical segment inventory. 


2. AFFRICATION is a structure-preserving rule, because [tf,t{"] already occur in the 
lexical segment inventory. 


3. PALATALIZATION is not structure preserving, since, again, none of the consonants 
[c,ch,c’,f,f”,n,A] appear in the lexicon. 


4. Properties that are consistent with the status of AFFRICATION as a lexical rule are (a) 
its inability to apply across word boundaries, given that the righthand context is a 
suffix, and (b) the fact that it is structure preserving, while (c) the fact that it refers 
to morphological information necessarily implies that it is a lexical rule. 


Q67 
1. (1) VOWEL NASALIZATION V—[+nas]/___n), 
(2) n-DELETION n—9/[V,+nas]__), 
(3) 9-DELETION 97> 6/_ X) 
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fin fin-a 
VOWEL NASALIZATION é 一 
N-DELETION Ø 
9-DELETION 一 Ø 
Output [fg fin] 


. If [n] resyllabifies as an onset in the syllable [9m] after voweL NASALIZATION, but 


before n-DELETION, the vowel will be nasalized, but [n] be preserved. The resyl- 
labification was earlier given as LIAISON. 


. n-DELETION must be postlexical, because it must apply after words, in this case [yn] 


(or rather its lexical representation [&n]) and [om], combine into phrases. 


. 9-DELETION must be postlexical, because, again, information about the following 


word is needed to decide whether it is deleted. In this case, [9] does not delete 
in the context [d] ___ [d]. This failure of 3-DELETION between identical conso- 
nants can be seen as an OCP effect: if [ə] did delete, a sequence of identical 
consonants would be created. 


. VOWEL NASALIZATION must apply before a word-final [n] is resyllabified into a fol- 


lowing onsetless syllable. Therefore, if it is postlexical, it must be ordered 
before LIAISON. This would be unexpected, as generalizations would normally 
refer to the prosodic structures that are present at the relevant level of repre- 
sentation. As it happens, the rule has exceptions, such as the loanwords and 
proper names abdomen [abdomen], pollen [polen], Citroen [sitroen], Le Pen 
[la pen]. 


. We could either assume an UR [əndə], which will be converted to [5d] by a ‘free 


ride’ on the grammar that we need anyway, or we could assume [5da] as the 
UR, which is possible because nasalized vowels are produced by a lexical rule 
and must by implication belong to the lexical segment inventory. If we go by 
the rule-of-thumb that we do not make URs more abstract than they need to 
be, the second option would be preferred. Perhaps psycholinguistic research 
may some day tell us whether this question is meaningful. 


No, the rule applies freely to all forms, also to the underived form /pad/ ‘toad’. 


. The rule that produces [e:] is lexical, because it must refer to the morphologi- 


cal status of [d] to be able to distinguish made and stayed, for instance. 


. The rule is not neutralizing, since the product of the rule, [e:], does not already 


appear independently in other words. 


. Itis not structure-preserving, for the same reason. However, because the rule is 


evidently lexical, there is the prediction that [e:] could readily be incorporated 
in URs, in loanwords, for instance. 


. The rule does not show the effect of Non-Derivep ENVIRONMENT BLOCKING, 


because it readily applies to a morphologically non-complex form like day. 
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Q70 In unguarded speech, RP cents and sense would be expected to be homophonous, 
since the phonological representations are the same. Phonetic implementation rules are 
not expected to take account of the history of the phonological surface representation ` 
they are required to translate into phonetics. Nevertheless, speakers would appear to be 
able to take all sorts of consideration into account when pronouncing linguistic expres- 
sions, and may for instance apply fewer optional assimilations in less frequent forms, or 
make words they use for the first time in their discourse longer than other words. In this 
connection, Hayes (1994) speaks of ‘the beast’ in man, suggesting that speakers can 
influence their speech behaviour from outside the language system, so to speak. 


Q71 The lengthening must be created during phonetic implementation. Languages 
vary in the extent to which they allow lengthening in sonorant segments before voiced 
obstruents in the coda, Arabic being extremely reluctant and English extremely liberal 
in this respect (Kluender et al. 1988). If we were to interpret the duration differences 
as resulting from phonological duration distinctions, English would have to be analysed 
as having a four-way quantity opposition, for instance. 


Q72 When the assimilation produces a segment which is indistinguishable from 
[f], it is reasonable to assume that the assimilation is phonological, i.e. that the fea- 
tures of the right-hand [f] have spread to the left-hand consonant. However, where the 
assimilation results in a gradual shift from [s] to [J], it is reasonable to describe it as 
the effect of phonetic implementation. In a gestural model, for instance, the tongue tip 
gesture for [s] could be shortened and/or that for [f] be extended forward in time. 


Chapter 9 


Q73 
1. The underlying tone pattern for both words is HL. 


2. The association in Tharaka must not start with the second TBU, but with the first. 


3. The Tharaka word for ‘way of releasing oneself quickly’ will be realized as 
[mdérékangérié |. The underlying representation 


mo e rek ang erie 
L H L H 


will by the Association CONVENTION (7a) be changed into 


mo'e rek ang eri€ 


Ch ie a eee 


L H L H 


and by (7b) into the phonological surface representation 


mo e rek ang erig 
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Q74 ! sats ; 
1. The word [amakosi] will be realized as [4makosi]. After the initial lexically 
specified association, the following structure obtains. 


ama kosi 


HL“ HE 


which by the association CONVENTION (7a and 7b) will be changed into 


ama kosi 


BL HL 


2. Part (c) of the association CONVENTION cannot apply, given that contour tones 
on short vowels are disallowed in Zulu. Thus, [amakosi] is ruled out, and the 
final L tone will therefore be deleted. 


Q75 The pitch peak and the final rise would both occur in John, creating a falling- 
rising contour, as shown. 


[has John ] 
| a 


L H*LH 


Q76 The word [ikpa] would have a single L. On the assumption that the OCP 
remains in force after the morphological operation of REDUPLICATION, the sequence of 
two L-tones in the reduplicated form would be reduced to a single one, which is asso- 
ciated with as many vowels as remain in the word. There would thus be no need for the 
TWIN SISTER CONVENTION to operate. 


Q77 
1... H,.L,/HL,.LH; LHE 


2. When LHL occurs with a single syllable, as in the case of the word for com- 
panion’, all three tones are pronounced. Unlike Etung, therefore, the language 
does not obey the No CROWDING Constraint in (9). 


3. njaha kpakali 
| /\ \V 
LHL L 


Q78 The fact that H spreads to all syllables, and never to only some, suggests that 
there is only one L-tone which is responsible for the low-toned syllables. When this 
L-tone is deleted, all TBUs with which it was associated will associate with H. 


Q79 
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akivara ataasiq akivaa 


HLH HLH HLH 


The direction of association is right-to-left, since the rightmost TBU always 
has tone. 


Yes, they are. If there is one more TBU, the HLH melody shifts right by one TBU. 
apirai apira ai 
| l III 
HLH HLH 
‘He asked them’ ‘Did he ask them? 


Since in the last two syllables there are four TBUs and three tones, the leftmost 
H must spread to an extra TBU on the left if it occurs in the same syllable. 


Chapter 10 
Q80 They all break the SYLLABLE Contact Law, 


Q81 


1. 


First, the words in (a) have one, two or three syllables, while those in (b) all 
have two. Second, the words in (a) have a variety of vowels, those in (b) [ai] and 
[ə], in that order. 


The consonants are preserved, but the vowels are not. 


The number of different consonants in the word. If there are at most two 
different consonants, there are three surface consonants in the disguise form, 
and if the number of different consonants in the word is three, there are four 
surface consonants in the disguise form. 


Use the template CVCVC if there are at most two different consonants, and 
CVCCVC if there are three. Associate the consonants to the C-slots, left-to- 
right, one-to-one, spreading the last. Associate [ai] and [9] to the V-slots. 


In Arabic, adjacent C-slots and adjacent V-slots associate with the same vowel 
or consonant, creating long vowels and geminates, but in this secret language 
adjacent C-slots are filled by different consonants if available. 

k 

Sse 


W r 
Lea 
cvs eve cre 
| | 
3 9 


a 

m 

CV 
/\ 


al 


人 


ai 


Q82 First, a V-slot associates with at most one vowel segment, which causes all 
diphthongs, which consist of two vowel segments, to be long. Second, association 


is left-to-right, one-to-one, only [iaa] can arise from the association of [ai] to VVV, 
never [aai]. 
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Q83 Together with its syllable node, a word-initial V-slot is deleted if associated with 
a high vowel after a V-slot. The high vowel re-associates leftward with the remaining 
V-slot, creating a diphthong, as shown below for the relevant structure in catald 
universal. The transcriptions with consonantal glides indicate the non-syllabic nature 
of the remaining vowel. 


o Oo o 
A | /\ 
CVV C Gye 
ee a ee 
Lawi laun 


Q84 Since Koya has maximally two moras, and WEIGHT-BY-POSITION, these structures 
must be as given below. 


o o o o o o o Cr 
Paf PER [> Dat: 

| | | | NAS | | | | 
p tk a m an ay wrk ‘wu b esko è 


Q85 The number of moras in the prefinal syllable. If there is one, the following 
schwa is deleted. Tera must have WEIGHT-BY-POSITION, since otherwise this generalization 
across short vowels-plus-coda consonants and long vowels could not be made. 


Q86 Lenakel appears to stress the last syllable if it contains a long vowel, and the 
penultimate otherwise. The language does not therefore equate coda consonants with 
the second halves of long vowels and thus has no WEIGHT-BY-POSITION. 


Q87 
1. Inthe Bakwiri word game, the segments of the second syllable are placed in the 
first, and vice versa. The location of tonal and moraic structure is unchanged, 
however. [ndak66] therefore becomes [kòndáá]. 


2. In [litinga] a long vowel occurs in the first syllable, but [liòßá] has two differ- 
ent vowels in the same syllable. 


3. The word for ‘door’ has a short (monomoraic) second syllable, and only one 
of the two vowels can be moraic. The game form must therefore either 
be [Baaliw] or [Baalj6]. The first of these must be excluded because the 
language has no codas. 


4. The glottal stops are there because the syllables concerned begin with a vowel. The 
glottal stops are more perceptible word-internally than at the beginning of a word. 


Q88 In Kinshingelo, the segment strings in the last two syllables of the word are 
reversed, while the tonal and moraic (timing) structure remain intact. Obviously, this 
operation cannot be carried out without reference to these elements of the phonolog- 
ical representation, which must for that reason exist. 
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Q89 
1. All syllables that are not stem-final are short, regardless of whether they were 
short in the stem, as in b, d, e, or long, as in a, c, e, f. 


2. The vowel in the stem-final syllable is long if its syllable has no coda, but short 
if it does. 


3. Ina moraic representation we can say that stem-final syllables have two moras. 
The adjustment of the vowel duration then follows from the presence of the 
stem-final consonant. For the word-final syllable, we can either assume these 
are obligatorily trimoraic, forcing the vowel to be long, or that they obligatorily 
have a long vowel, in which case the final consonant is non-moraic. Assuming 
the first option, here are the structures for ‘house’: 


[% ERR RR 


aj +an a h 


With a CV-tier, the equivalence of the rhymes in the stem-final syllables is less 
easily expressed, because the first occurrence would be VC and the second VV. 
This case represents the same type of argument against the CV-tier as the one 
given in the text on the basis of compensatory lengthening. 


Q90 
1. Ifthe BP rhyme is maximally bimoraic and [j] is ambisyllabic, this consonant 


will occupy the second mora of the syllable it closes, meaning that there is no 
room for the second element of a diphthong. 


2. Weassume that, as in English, foot-internal syllable-initial consonants are made 
ambisyllabic, and thus contrast structurally with foot-initial consonants. 


3. We can then say that vowels nasalize before nasal consonants within their rhyme. 
4. This solution accounts for the fact that all vowels nasalize before [j], because 
it is ambisyllabic regardless of whether it is foot-initial or foot-internal. 


Q91 An SPE-style account that does not refer to the syllable would have to say that 


[COR, + approx] is [一 cons] in the context before a word boundary or before a consonant, 
like (1) 


COR 一 [-cons] / Phd 
+approx = 


The conjunction either a consonant or a word boundary is unnecessary if the syllable 
coda can be referred to. 


Q92 In the words in I and III, the [r] is ambisyllabic. In I this is because it occurs 
internally with in a foot, opening a weak syllable (RIGHT capture), while in III it is 
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because the next word is vowel-initial (taison). It is ambisyllabicity that thus explains 
the occurrence of the flap. 


Chapter 11 


Q93 
1. It is the [coronat] node that spreads to the PLACE node of a preceding 
[t,d,n,l] in English coronar ASSIMILATION. 


2. It is the PLACE node that spreads from the consonant to the SUPRALAR node of a 
preceding nasal in Hindi. 


Q94 The picture you end up with should look more or less as shown. 


root root 


LAR 


SUPRALAR 


PLACE PLACE 


Q95 You cannot, because the PLACE tier and the roor tier are not adjacent, and hence 
do not define a plane. 


Q96 Natural feature classes are not phonetically abstract to the extent that the nodes 
in a feature tree can be defined in phonetic terms. For instance, the laryngeal node 
could phonetically be defined as ‘activity in the larynx’, the surpralaryngeal node as 


‘activity in the vocal tract’, etc. 


Q97 English PLACE assimitation can be stated in the display format as follows: 


SUPRALAR 


PLACE 


SD: CORONAL [一 cont] 
—cont 
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Q98 
1. In order to account for the Yorkshire English and Durham English assimilation 
facts, we could assume the SPE rules in (1) and (2). 
(1) YORKSHIRE VOICELESSNESS ASSIMILATION [—son] 一 [—voice]/__ # # [— voice] 
(2) DURHAM VOICING ASSIMILATION [一 Son] 一 [+voice]/__ ##[+ voice] 


2. Given that the feature [ — voice] needs to be referred to in the structural change 
and the structural description of YORKSHIRE VOICELESSNESS ASSIMILATION, it cannot be 
left unspecified and be filled in only at the end of the derivation. 


Q99 Two ordered rules are necessary to derive the correct Klamath output forms. 
The first rule should spread the SUPRALARYNGEAL node of a lateral consonant to a pre- 
ceding [n] or [I]. The second rule should delink the supralaryngeal node of the lateral 
consonant, which, if it is specified as [—voice] or as [ + constricted], will leave behind 
only its laryngeal specification, causing it to end up as [h] or [?]. The rules can be 
formalized as (1) and (2) below. 


(1) LAR 
x SUPRA 
SD: CORONAL [+lat] 
—cont 
(2) LAR 
SUPRALAR 
SD: [+lat] 


We assume that in the case of [nl] rule (1) creates a geminate structure, as shown 


C C 
ROOT 
LAR 
SUPRALAR 


Rule (2) cannot apply to this structure, which will be supplied with the feature [十 voice] 
by default. 


KO 
Q100 ce . 

1. The full representation for the four types of initial plosives is the following: 
voiced voiceless voiceless preglottalized 

aspirated voiced 
+voice —voice —voice 十 Voice 
一 Spread 一 Spread 十 Spread 一 Spread 
—constr —constr —constr +constr 

2. Assuming radical underspecification, the voiced plosives could be character- 
ized by the presence of the feature [十 voice] in the underlying representation, 
the voiceless aspirated plosives by the feature [十 Spread] and the preglottalized 
voiced plosives by the feature combination [十 Voice, +constr]. The features 
[— voice], [—spread] and [一 constr] could then be left unspecified, to be filled 
in by default rules. This means that the voiceless unaspirated plosives are left 
unspecified underlyingly. 

3. Coda plosives can be left unspecified, as their feature values can be provided 
by the default rules. 

Chapter 12 
Q101 

1. NASALIZATION in Sundanese should transfer the nasality of a nasal consonant to 
any following vowel, but is blocked by any consonant, except when this is a 
laryngeal segment. 

2. Ifthe rule is formulated such that it spreads [+ nas] from a nasal consonant to 
the supRALARYNGEAL node of a following vowel (V-position), it will be blocked by 
any segment that has a supralaryngeal specification. Since [h] and [?] do not 
have a SUPRALARYNGEAL node, these segments are invisible for the spreading 
[+nas], and their transparancy is straighforwardly accounted for. 

Q102 

1. One default rule needs to assign [— voice] to a stem-initial obstruent and one 
needs to assign [+ voice] to an intervocalic obstruent. 

2. The two [+voice]-melodies are [一 voice] and [十 voice] [一 voice]. Similar to 
the avoidance of the first TBU at the start of the tonal association in Kikuyu in 
Chapter 9, the melodies cannot associate to a stem-initial obstruent. The 
features must not spread, because a [ — voice] melody would produce ungram- 
matical forms like *[itupi] and *[titupi] if it did. 

3. Whereas [udodo] does not have an underlying [+ voice] melody, [adapigo] is 
specified for a [十 voice][ 一 voice] melody. 

4. Underlying form /iTuPi/ /T3T3KaKe/ 


Melody association [一 voice] [+voice][— voice] 
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/ituPi/ /TadakaKe/ 
Default rules [itubi] [tadakage] 


5. The form is not ungrammatical, because the second consonant in [konopio] is 
a sonorant and therefore not a [+voice]-bearing segment. The underlying 
[—voice] melody, skipping the stem-initial obstruent, will therefore be associ- 
ated to the third consonant. 


Q103 Yes, if there is only one lexical ‘floating feature (p. 175), it must either be on the 
left or on the right of any occurrence of the same feature linked to a segment. 


Q104 

1. In the Finnish word game Siansaksa, the initial (C)V-sequences of two 
words are inverted. BACK HARMONY applies so as to spread the feature [+ back] 
from left to right through the word, whereby the vowels [i] and [e] are trans- 
parent, neither undergoing nor imposing BACK HARMONY. For instance, exchang- 
ing [sa] and [he] in [saksalaisia hety:tet:i:n] initially gives [he]-[ksalaisia] 
and [sa]-[ty:tet:i:n], which pack Harmony turns into [hekseleisie 
satu:tet:i:n]. 


2. The analysis accounts for the fact the fact that the first word in the game form 
in (a) is not *[hæsælæisiæ], because only the initial (C)V sequence is inverted 
and not the entire first syllable. 


3. The analysis accounts for the fact that the second from in (a) is not 
*[saty:tet:i:n], because [a], being a back vowel in Finnish, will trigger Back HAR- 
mony, changing [y] into [u]. 


4. Because [i] is transparent, it neither undergoes nor imposes BACK HARMONY, 
which is why, in [hitsansa], the original back vowels remain back. 


Q105 Complex segments like *[px] or*[dņ] would require more than one specifica- 
tion for both place and manner. Apparently, a segment cannot be a complex place seg- 
ment and a manner-contour segment at the same time. Indeed, such representations 


would raise the question why both manner phases should not occur at both places of 
articulation, to give *[pfkx]. 


Q106 The coronal affricate [tł] does not undergo DELATERALIZATION, which implies 
that the lateral release of the affricate is not identical to a simple lateral. If [t4], instead 
of being a single segment, were a sequence of two segments, DELATERALIZATION would be 
expected to apply to [t#ilé]. 


Q107 The first consonant and the first vowel of a verb stem are reduplicated, as in 
[fafja], derived from [fja]. Since [fefle] and [kpokplo] are derived from the verbs 
stems [fle] and [kplo], respectively, it must be the case that while [fl] is a cluster, [kp] is a 
single consonant, a complex place segment. Similarly, [ts] is a single segment in 


[tsitsi], from [tsi], because if [ts] were a cluster, the reduplicated form would be 
[titsi]. 
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If the two suffixes did not have different underlying forms, they would be real- 
ized identically in their unassimilated form in the (a) examples. 


2. The dental consonants in the third column are [coRoNAL + ant, + distr], the alve- 
olar ones in the fourth column are [coronaL,+ant,—distr], while the pala- 
toalveolar consonants in the last column are [coronaL,— ant, + distr]. 


3. It is impossible to, account for coronal HARMONY by leftward spreading of the 
[coronat] node to consonants unspecified for [coronat], because the transpar- 
ent coronal consonants would be incorrectly assimilated, while the consonants 
that have a [coronat] node would be incorrectly left unassimilated as well as 
incorrectly blocking the spreading of the [coroNAL] node. 


4. If it is assumed that laterals and [tdb] are unspecified for [coronar] under- 
lyingly, they will get a [coronar] node with the features [+ ant, —distr] by a default 
rule. Before that happens, CORONAL HARMONY can spread the [coronat] node of the 
rightmost coronal consonant to preceding consonants that have a [coronat] 
node, delinking these nodes and spreading to the next consonant on the left if 
there is one. 


Chapter 13 


Q109 Main stress in Passamaquoddy is on the penultimate syllable if that syllable is 
heavy (i.e., contains a full vowel), as in item (b), and if it is not, main stress is on the 
antepenultimate syllable provided it is heavy, as in item (c). If neither of these syllables 
is heavy, as in items (a), (d) and (e), main stress is on the penultimate or antepenultimate 
syllable, depending on which of these two is separated by an odd number of syllables 
from the rightmost preceding full-vowelled syllable. Thus, in [mekanutasdpanik_], item 
(a), main stress is on the antepenult, because an odd number of syllables, one, inter- 
venes between this syllable and the rightmost preceding heavy syllable [nu], whereas 
in [elakilakapan], item (d), main stress is on the penult, which is separated by an odd 
number of syllables from the rightmost preceding heavy syllable [ki]. If the word does 
not contain any heavy syllables at all, main stress is on the penult or the antepenult, 
whichever is separated by an odd number of syllables from the beginning of the word. 
Thus, in item (e), main stress must be on the antepenult, in order to be separated by 
an odd number of syllables from the left word-edge, three in this case. 


Q110 
1. Input forms which show that the main stress is not invariably located at some fixed 
distance from a word edge, e.g. always the first, or always the second, or always the 
last, etc., but varies between first and second, or between last and penultimate, 
whereby the edgemost syllables with main stress have long vowels and/or are 
closed. In the next section, it will be seen that the first or last syllable of the word 
may be extrametrical, so that ‘first and second’ may in fact be‘second and third’, etc. 


2. If the language learning child only comes across words in which the main 
stress is located within a two-syllable window at one word edge, she has no 
reason to assume unbounded stress, but if additionally she comes across 
words with the main stress outside that window, i.e. on the third syllable from 


Q111 
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the same edge or beyond, stress must be unbounded. In the next section, we 
will see that, due to extrametricality, the two-syllable window may in fact be a 
three-syllable window. 


The stress pattern of Icelandic can be accounted for by constructing iterative 


QI-ld feet from right to left, followed by the construction of a left-dominant unbounded 
word tree, as illustrated for 'höfðingjá and ‘biografia. 


höf 
h 
s 


ding ja bi 0 gra fi a 
h ] l l 1 1 l 
s w s w w 


Q112 


Se ae 


The examples show that main stress is on the first syllable if it is heavy and on 


the second if it is not, and a QS foot is thus required. By looking at a word which starts 
with two light syllables, like [ko.j6.90], we can determine that this foot must be right- 
dominant, for if it was left-dominant, main stress would wrongly be predicted on the 
first syllable in that word. A QS-rd foot is therefore required. Since there is no second- 
ary stress, feet must be assigned non-iteratively. 


Chapter 14 


Q113 
1; 


2 


Q114 


The Turkish stress system exemplified by these words must be QS, for other- 
wise stress would be on a fixed syllable. 


The fact that stress is never on the final syllable can be accounted for by 
assuming final syllable extrametricality. 


. Inorder to determine whether a moraic trochee or an iamb is needed to account 


for the Turkish non-final stress pattern, words with light antepenultimate and 
penultimate syllables should be considered, like tornavida and çikolata. 
Assuming final syllable extrametricality, a moraic trochee would incorrectly 
locate the stress on the antepenult, *torndvida, *çikólata. By contrast, an iamb will 
be correctly stress the penultimate syllable under the same assumption of final 
extrametricality. 


To explain the existence of both cornice and cdlice, final extrametricality could 
be assumed to be a lexical property in Italian. If the final syllable (or mora) of 
calice is extrametrical, but not the final syllable of cornice, the construction of 
a moraic trochee at the right word edge will correctly produce stress on the 
penult in cornice and on the antepenult in calice. 
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2. Main stress on a final light syllable can be accounted for, if, like final extramet- 
ricality, final mora catalexis is a lexical property. Words like cittd and colibri are 
specified as having a final catalectic mora, and the construction of a moraic 
trochee at the right word edge will, again, correctly produce stress on the final 
syllables of these words. 


Q115 Hayes’ (1995) analysis of Turkish non-final stress (moraic trochees plus final 
foot extrametricality under clash) would wrongly predict final stress in Indiyanapdlis 
and in Us kü dar, given that the final foot will not be made extrametrical, as there is no 
clash, as shown below. 


In di ya na po lis 


mm m m m m mm 
i) & 4 & . &) 
Ys ky dar 
mm m mm 
(x.) (x.) 


Q116 The domain of the obligatory occurrence of [p] is the foot. That is, if the nasal 
and the dorsal consonant are within the same foot, as in increment, the nasal is oblig- 
atorily realized as [n], whereas if they are not, as in the verb in (créase) or the noun 
(in)(créase), the nasal may be realized as either [n] or [p]. 


Q117 Dutch [h] is not realized in foot-internal position. The fact that [h] is not real- 
ized in the word marihudna shows that the first three syllables of [mat.ri.u.ai.na] 
form a single foot. 


Q118 


/nankikirikirimpaji/ FrBin | RuTyp PARSE-O 
(T) ad Fr, L) if R, , R) 
[(pánki) ( ) (riki) #00000 
(rimpa) ji] #000! 
#0 


[(nán) (kici) * #000000 
(kirim)(paji)] #0000 
#00 


Te S | 


= (kirim) #00000 
(paji)] #00 
ye )(riki)rim #00000 
(paji)] #ooo! 
[(nanki)ri ( ri (kirim) #00000 
(páji)] #00 
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Q119 The first form of each language shows that NoNFINALITY must dominate the con- 
straint that requires a word to end in a foot, At (w, R, Fr, R). The second form of each 
language shows that the constraint requiring words to start with a foot is less important 
than the one requiring words to end in a foot. Therefore, At (w, R, Fr, R) must dominate 
At (, R, Fr, L). Finally, the last form of each language shows that QS must dominate 
PARSE-O in Latin, but that for Macedonian Parse-0 must dominate QS. 


FrBin | Parse-o | AL 
(o, L) 


Q120 


/tata/ 


en oe 
eis 
cae 


NonF | FTBIN | Parse-o | AL 
(w, L) 


Eon ae es ae a 


mala(nolé)ju 


* 


oa 
一 
pa 


ei 

tay 
=x 
— 
i] 


z 

© 

zZ 
Ti 
B 
= 
y 

z 
az] 

> 

hd 

T 

Q 
ee 

= 
ef 
Z 

工 
~ 
za 
=x 
~~ 
pa 


a ae ae 
ob | 
OTER S 


(tipò)(hó)re 


Q121 
1. First, RaTYpe (T) 》RHTYPE (I). The four-syllable word [asikada] shows that 
RuType (T) must domipate RuTyre (I) in order to get a trochaic pattern, 
instead of an iambic [asikidá]. Second, FrBIN » PARSE-O. Footing a trisyllabic 
word like [kirika] will produce one stress, with one syllable left unfooted, 
showing that FTBI must crucially rank above PARSE-O, in order to get [kirika] 
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instead of [kirika]. Third, At (Ft, R, œ, R) » Ar (Ft, L, @,L). Assuming a trochee, 
the same trisyllabic [kirika] shows that the foot must be right-aligned with a 
word, not left-aligned. In the lefthand column the correct rankings are shown, 
and the incorrect rankings are shown on the right. 


/asikada/ RHTYPE(T) 
= 


oman | 
— 


os *(asi)(kadá) 


ws e 
ST A 


de | [eo 


hike’ [neen] arbon | 


w ki(rika) 
/kirika/ AL (Ft, L,w,L)| Ar(FT,R,w,R) 


Q122 In order to get final stress/H-tone in [apataka], we need to assume iambs rather 
than trochees, that is, RaTYPE (I) » RuType (T), and also that footing needs to be iterative, 
that is, Parse-o » At (Fr, L, œ, L). Trisyllabic [if6ci] shows that a word must start with a 
foot rather than end with one, that is, AL (Fr, L, w, L) » At (Fr, R,@, R), and that no mono- 
syllabic light feet are allowed, giving FrBIN » Parse-o. Finally, in order to get final main 
stress/H-tone, we need to assume that Auien (Hean, R,@, R) (H/R) dominates Arien (HEAD, 
L, œ, L) (H/L). The constraint ranking is illustrated for [ifoci] and [apataka]. 
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(fó (ci) 
væ (if6)ci 


/apataka/ FTBIN | Parse-o 
(apa) (taka) 
(apa)taka + # 


apa(taka) 


oR is Silo 
usr (apa)(taka) pia eee 


Chapter 15 


Q123 


1. Binary must rank below AtiGNFOC, which makes the violation of Binary 
unavoidable. 


2. When liburua is the focus constituent, a p-boundary must appear before it. 
Lower-ranking Binary will now make itself felt again by forcing Amaiari and 
amuden to occur in the same ọ, despite the requirement of lowest-ranking 
ALIGNXP that there must be a 中 -boundary before the XP amumen liburua. 
With liburua as the focus constituent, the prosodic structure will be „(Amaiari 
amumen), .(liburua), ¿(emon dotzo). 


Q124 The -structures are 


(1) (I was explaining), ,(to the students), ,(the problem), (of the double 
negation), o(in English), 


(2) (I was explaining), ,(to the students), (the problem), 


Example (2) is ungrammatical because a condition on Heavy NP SHIFT is that the 


NP that is moved must contain minimally two gs, and in (2) it contains only one, 
the problem. 


Q125 In This is represented in (6), the adverbial phrase in six can be included with its 
verb represented in the same 中 ,a case of restructuring, because the next constituent is 
a single phonological word in six, where in is a function word and as such a weak syl- 
lable. As a result, the stress pattern of répresénted changes to répresénted before six in 
the same 中 . However, in This is represented in (6a), the adverbial in six a consists of 
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two phonological words, in six and a. As a result, it is not included in the same ¢ as 
represented, so that the RHYTHM RULE cannot apply. 


Q126 The domain of this domain span generalization is the wœ. In the second 
column, [r] appears as the first segment of a lexical category N or V. As a result, it 
will begin a new 0. 


Q127 Inthe examples in (1), the vowel is the first segment of a lexical category N or 
Adj. As a result, it begins an w, causing the preceding [s] to be in a different @. 
Incidentally, this domain juncture rule will require that the two ws occur in the 
same IP. 


Q128 The occurrence of [an] is purely determined by the presence of a following 
vowel, regardless of the morphological status of the word it occurs in, as shown by an 
apple, an excellent pear, an awfully nice apricot, an I-don’t-know look. 


Q129 Syntax-dependent alternations can have exceptions in Hayes’ precompilation 
theory, because this theory places forms whose occurrence is sensitive to syntactic 
information in the lexicon, just like forms that are sensitive to morphological infor- 
mation. As such, the syntax-sensitive alternation is in principle subject to the same 
exceptions as alternations that are sensitive to morphology. 
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The language index below was modelled on the one in Laver (1994) and composed with the help of Grimes 


(1988). 
Language 


Xu 

Akan 
Amharic 
Angaatiha 


Arabela 
Arabic 


Cairene Arabic 
Gulf Arabic 
Tetouan Arabic 
Arawak 


Bakairi 
Bakwiri 
Balantak 
Basque 
Bengali 


Brao 


Bura 
Burmese 
Catalan 
Cavineña 


Chipewyan 


Chuave 


Ciyao 
Corsican 
Creek 
Czech 
Danish 


Language index 


Family 


Khoisan 
Niger-Kordofanian, Kwa 
Semitic 

Trans-New Guinea, Angan 


Zaparoan 
Semitic 


Arawakan 


Carib 

Niger-Kordofanian, Bantu 
Austronesian 

Isolate 

Indo-European, 
Indo-Aryan 
Austro-Asiatic, 
Brao-Kravet 

Afro-Asiatic 
Sino-Tibetan 
Indo-European, Romance 
Tacanan 

Athabaskan 


Trans-New Guinea, 
Chimbu 
Niger-Kordofanian, Bantu 
Indo-European, Romance 
Muskogean 
Indo-European, Slavic 
Indo-European, 
Germanic 


Geographical area 


Namibia, Angola 

Ghana (South) 

Ethiopia, Egypt 

Papua New Guinea (Moroba Province, 
Menyamya District) 

Peru (Arabela River) 

Saudi Arabia, Algeria, Bahrain, Chad, 
Egypt, Ethiopia, Iraq, Jordan, Kuwait, 
Lebanon, Libya, Morocco, Oman, 
People’s Democratic Republic of 
Yemen, Qatar, Somalia, Sudan, Syria, 
Tunisia, United Arab Emirates, Yemen, 
Arabic Republic 

Egypt (Cairo) 

United Arab Emirates, Saudi Arabia 
Morocco (Tetouan) 

Surinam, French Guiana, Guyana, 
Venezuela 

Brazil (Mato Grosso) 

Cameroon (South) 

Indonesia (East central Sulawesi) 
Spain, France 

Bangladesh, India, Singapore, 

United Arab Emirates 

Laos, Cambodia 


Nigeria 

Burma, Bangladesh 

Spain (Northeast) 

Bolivia (Beni River) 

Canada (North-west Territories, 
Saskatchewan) 

Papua New Guinea (Chimbu Province) 


Malawi, Tanzania, Mozambique 
France (Corsica) 

USA (Oklahoma, Alabama) 
Czech Republic, Slovakia 
Denmark 


Page numbers 


14, 24, 27, 31 
69, 174-175 
75, 141 

25 


24 
13,31 
139-141 


190, 202 
9 

192 

11 


31,173 
148, 179 
56 

221 

74 


168 


31 

11,25, 145 
215 

12 


27,31 


126 
64,74 
216 
25,31 
72 


Language Family 

Dinka Nilo-Saharan 

Dizi Afro-Asiatic 

Dutch Indo-European, Germanic 


Alphen Dutch 
Kaatsheuvel 
Dutch 
Sittard Dutch 
Southern 
Dutch 
Utrecht Dutch 
Wehl Dutch 


English Indo-European, Germanic 


American 
English 
British English 


Canadian 
English 

Durham 
English 

London English 

Northern Irish 
English 

Old English 


Scottish 
English 
Singapore 
English 
Yorkshire 
English 
Estonian 
Etsako 
Etung 


Uralic, Balto-Finnic 
Niger-Kordofanian, Kwa 
Niger-Kordofanian, Bantu 
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Geographical area 


Sudan 

Ethiopia 

Netherlands, Belgium, France, 
Surinam, Netherlands Antilles 


Netherlands (Alphen en Riel) 
Netherlands (Kaatsheuvel) 


Netherlands (Sittard) 
Netherlands (South), Belgium 


Netherlands (Utrecht) 
Netherlands (Wehl) 

Australia, Canada, India, Ireland, 
Pakistan, Singapore, South Africa, 


UK, USA, and many other countries 
USA 


UK 


Canada 
UK (Durham) 


UK (London) 
UK (Northern Ireland) 


Extinct 

UK (Scotland) 

Singapore 

UK (Yorkshire) 
Estonia, Latvia 


Nigeria 
Nigeria, Cameroon 


Page numbers 
4,5, 25, 131, 147 
31 

9, 10, 11, 13,20 
23, 25, 28, 29, 

31, 37, 38, 39, 

47, 49, 54, 55, 

60, 61,67, 70, 

74, 79-81, 83, 
89-94, 96-106, 
107-109, 111-112, 
115, 117, 137, 138, 


103, 104 
89, 90, 91 


96, 99-105 
11 


103, 104 

74, 103, 105 

1-16, 18, 20, 23, 
25, 26, 28, 31, 32, 
35-44, 49-55 

58, 62-69, 80, 81, 
83, 85-88, 90, 

108, 110-112, 115, 
117,119, 120, 129, 
137, 139, 143, 145, 
149-154, 159-163, 
166, 168, 176, 177, 
183, 188, 207,210, 
218, 219, 221-224, 
230, 231 

7, 14, 25, 63 


4,7, 13, 14, 25, 49, 
53, 54, 68, 87, 119, 
120, 134, 149, 154 
168, 183 

86, 87,90 

168 


161 
115 


50, 81, 151, 

9, 65, 66, 111, 141 
52,53 

168 

25 

127, 130 


124-127, 129, 130, 
132, 170, 173, 174 
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Language 


Ewe 
Fijian 
Finnish 
French 


Frisian 
Gaelic 
Garawa 


German 


Gothic 
Greek 
Haida 
Hausa 


Hawaiian 


Hindi 
Hixkaryana 


Hopi 
Icelandic 
Igbo 
Imonda 


Indonesian 
Italian 


Japanese 


Karen 
Kayardild 


Kikuyu 
Kilivila 
Kipare 
Kisanga 
Klamath 
Konjo 
Korean 
Koya 
Latin 


Lenakel 
Luganda 
Maba 
Macedonian 


Family 


Niger-Kordofanian, Kwa 
Austronesian 

Uralic, Balto-Finnic 
Indo-European, Romance 


Indo-European, Germanic 
Indo-European, Celtic 
Australian, Garawan 


Indo-European, Germanic 


Indo-European, Germanic 
Indo-European, Hellenic 
Na-Dene 

Afro-Asiatic 


Austronesian 


Indo-European, 
Carib 


Uto-Aztecan 
Indo-European, Germanic 
Niger-Kordofanian, Kwa 
Trans-New Guinea, Waris 


Austronesian 
Indo-European, Romance 


Isolate 


Sino-Tibetan 

Australian, 
Non-Pama-Nyungan 
Niger-Kordofanian, Bantu 
Austronesian 
Niger-Kordofanian, Bantu 
Niger-Kordofanian, Bantu 
Penutian 

Austronesian 

Isolate 

Dravidian 
Indo-European, Romance 


Austronesian 
Niger-Korofanian, Bantu 
Nilo-Saharan 
Indo-European, Slavic 


Geographical area 


Ghana, Togo 

Fiji 

Finland 

France, Belgium, Canada, 
Luxemburg, Switzerland, worldwide 
as a lingua franca 


Netherlands (Friesland) 

UK (Outer Hebrides) 

Australia (Borroloola, Northern 
Territory and Doomadgee, 
Queensland) 

Germany, Austria, Switzerland, 
Liechtenstein, Poland, Romania, 
Ukraine, Russia, Brazil, Argentina 


Extinct 

Greece, Cyprus 

USA (Alaska), Canada (BC) 
Nigeria, Benin, Burkina Faso, 
Cameroon, Chad, Ghana, Niger, 
Sudan, Togo 

USA (Hawai'i) 


India, Nepal 

Brazil (Amazonas, upper Nhamunda 
River to Mapuera and Jatapa rivers) 
USA (Arizona) 

Iceland, USA 

Nigeria 

Papua New Guinea (Imonda), 
Indonesia (Irian Jaya) 

Indonesia (Java, Bali) 

Italy, USA, South America 


Japan, Taiwan, USA, Brazil, Thailand 


Burma 
Australia (South Wellesley Island) 


Kenya 

Papua New Guinea 
Tanzania 

Congo (Shaba) 

USA (Oregon) 

Indonesia (East Sulawesi) 
South Korea, North Korea 
India 

Extinct 


Vanuatu 

Uganda 

Chad (East), Sudan 
Macedonia 


Page numbers 


64, 174 


12, 25, 85, 174-176 
21,.2552,5 
8-11, 21, 25, 34, 43, 
80, 82, 88, 91, 113 

114, 117, 120, 138, 
142, 155, 180, 230 

66, 143, 162, 163 

141 

193, 198, 200, 212-216 


2,5 ,9, 10, 12-14, 25, 
26, 28, 35, 37, 54, 64, 
69, 112, 113, 137, 138, 
143, 144, 167, 176, 
177, 203, 224, 226, 231 
144 

64, 69, 81, 182, 226 
26, 27 

31, 230, 231 


5, 12,24,25,26, 28, 31, 
34-38,44,145 

5, 10, 158-160, 165 
31, 196-201, 204, 
205, 214, 216 

145, 146, 197 

197 

26,49 

12,72 


34, 37,44, 45 

8, 12, 25, 70, 73, 146, 
149, 180, 203, 218, 
226, 228 

12, 22, 25, 31, 38, 39, 
53, 138, 181, 187 
5,62 

68 


126, 127 

68 

132 

148 

24, 25, 163, 168 
34, 37, 44, 45, 46 
8, 49, 50, 113 

147 

73, 180, 187, 188, 189, 
195, 196, 198, 202, 
208, 215, 216, 230 
148 

75, 136, 144 

24 

215 


Language 


Malayalam 
Mandarin 
Margi 
Mauritius 
Creole French 
Mende 
Mura 
Mwera 
Nama 
Nambakaengo 
Norwegian 
Orochen 
Oromo 
Passamaquoddy 


Pawaia 


Polish 
Portuguese 
Brazilian 
Portuguese 
European 
Portuguese 
Proto-Indo- 
European 
Provençal 
Rotokas 
Russian 
Sanskrit 
Serbocroat 
Shona 


Sinhalese 
Spanish 


Colombian 
Spanish 
New World 
Spanish 
Sundanese 
Swedish 
Tagalog 

Tahltan 
Tamil 
Tera 
Thai 
Tharaka 
Tlingit 
Toda 
Tolitoli 
Tonkawa 
Tsou 
Turkish 
Venda 
Vietnamese 
Welsh 
Weri 


Family 


Dravidian 
Sino-Tibetan, Chinese 
Afro-Asiatic 
French-based creole 


Niger-Kordofanian, Mande 
Afro-Asiatic 
Niger-Kordofanian, Bantu 
Khoisan 

East Papuan 
Indo-European, Germanic 
Altaic, Tungusic 
Afro-Asiatic 

Algonquian 


Trans-New Guinea, 
Teberen-Pawaian 
Indo-European, Slavic 
Indo-European, Romance 


reconstructed 


Indo-European, Romance 
East Papuan 
Indo-European, Slavic 
Indo-European, Indo-Aryan 
Indo-European, Slavic 
Niger-Kordofanian, Bantu 


Indo-European, Indo-Aryan 
Indo-European, Romance 


Austronesian 
Indo-European, Germanic 
Austronesian 

Na-Dene 

Dravidian 

Afro-Asiatic 

Daic, Tai 
Niger-Kordofanian, Bantu 
Na-Dene 

Dravidian 

Austronesian 

Gulf 

Austronesian 

Altaic 

Niger-Kordofanian, Bantu 
Austro-Asiatic 
Indo-European, Celtic 
Trans-New Guinea, 
Kunimaipa 


Geographical area 


India 
China 
Nigeria 
Mauritius 


Sierra Leone 
Cameroon 

Tanzania (Southeast) 
Namibia 

Solomon Islands 
Norway 

China (Heilongjiang) 
Ethiopia 

Canada, USA (Maine, 
New Brunswick) 
Papua New Guinea 


Poland 
Portugal, Brazil, USA 
Brazil, USA 


Portugal 


France, Italy, Monaco 
Papua New Guinea 
Russia, former USSR, USA 
India 

Bosnia, Croatia, Serbia 
Zimbabwe, Mozambique, 
Zambia 

Sri Lanka 

Spain, South America 


Colombia 
South America 


Java, Bali, Indonesia 
Sweden 

Philippines 

Canada (BC) 

India 

Nigeria 

Thailand 

Kenya 

USA (Alaska), Canada (BC) 
India (Orissa, Tamil Nadu) 
Indonesia (Sulawesi) 

USA (Oklahoma) 

Taiwan 

Turkey, Germany, Cyprus 
South Africa, Zimbabwe 
Vietnam 

UK (Wales) 

Papua New Guinea 
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Page numbers 


136 
122, 144 
68,176 
43 


132, 170 


25, 177, 183 
9,11, 154 
154 


4,11 
192 


49 

24,26 

108, 170-171 
172 

183 

130 


10 

5, 11,255, 61, 117, 137, 
154, 161 

61 


161 


173 

49, 138 

149, 209, 210, 218 
177, 184 

136, 147 


64, 72, 81, 203, 204, 226 
64 

14,31 

12 

195, 198, 200, 216 
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Language 


West Greenlandic 
Wolof 

Xhosa 

Yoruba 

Yurakaré 

Zulu 


Family 


Eskimo-Aleut 
Niger-Kordofanian, West Atlantic 
Niger-Kordofanian, Bantu 
Niger-Kordofanian, Kwa 

Isolate 

Niger-Kordofanian, Bantu 


Geographical area 


Greenland 

Senegal 

South Africa 

Nigeria, Benin, Togo 

Bolivia 

South Africa, Malawi, Swaziland 


Page numbers 


215 
127 


subject index 


This subject index lists only the sections and chapters in which the subject concerned is dealt with fully, and 
does not list all pages on which the topic is mentioned. 


p see mora 
see phonological phrase 

w see phonological word 

g see syllable 

Absolute Slicing Hypothesis 9.2 
acquisition 2.5 

affix 2.2.1 

affricate 1.5.2, 1.6, 5.4.3 

Aitken’s Law 8.3.3, Q40 
allomorphs 4.3, 15.6.2 

allophone 4.2, 4.3, 8.2 

alveolar 1.5.1 

alveolar ridge 1.3.3 

alveopalatal 12.3.2 

ambisyllabic consonant 10.7, 10.7.1, 15.7 
American Structuralists 8.2 
anterior (feature) 5.5, 11.4.1 
approximant (consonant) 1.5.2 
approximant (feature) 5.4.1 
aspiration 1.2.2, 5.4.2 
assimilation 11.2, 11.5.1 
Association Convention 9.3, 9.3.1 
association line 9.3, 11.5 


ATR (feature) 5.5 
autosegment 9.1, ch. 9 
Autosegmental Phonology chs 9-12 


back (feature) 5.5, 11.4.1, 12.3.2 
back (of the tongue) 1.3.3, 1.4 
base 2.2.1 

bilabial 1.5.1 

binary (feature value) 5.3 
blade 1.3.3 

bleeding order 6.6.3 

bound morpheme 2.2.1 
boundary symbol 6:2, 15.2 
brace notation 6.3.2 

breathy voice 1.2.3 


catalexis 14.2 
category membership 2.2.1 
central (tongue body) 1.4 


clause 2.2.2 

click 1.8 

clitic 14.4, 15.6.1 

Clitic Group 15.6.1 

close (Vowels) 1.4 

cluster 2.4 

coda (of syllable) 2.4, 10.6 
compensatory lengthening 10.4.3 
complementary distribution 4.2 
complex (segment) 1.7, 12.3 
complex (word) 2.2.1 

complex place segment 12.3 
complexity (phonological) 2.5, 2.5.1, 2.5.2 
concord 2.3 

consonantal (feature) 5.4, 11.4 
consonantal place node 11.4.1, 12.3 
constraints 3.4.2, 3.5 

constricted glottis (feature) 5.4.2 
continuant (feature) 5.4.3, 11.4.3 
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